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A STUDY OF THE TRANSPOSITION GRADIENT? 


DAVID EHRENFREUND 
The State College of Washington 


Despite the mass of data collected 
on the white rat, a recent study by 
Kendler (12) represents the only re- 
port to date in which the transposition 
phenomenon has been systematically 
investigated with this S. This pau- 
city of data perhaps accounts for 
Munn’s omission of the theoretical 
implications of transposition in his 
chapter on systematic psychology. 


.In discussing the data, however, he 


recognizes its theoretical import and 
accepts unquestioningly the gestalt 
notion that the response, initially at 
least, is to a relationship (16, pp. 131, 
143, 178). Other recent publications 
(1, 5, 11, 15) not restricted to rat data 
nonetheless arrive at the same con- 
clusion despite evidence that should 
lead at least to a questioning of the 
adequacy of either the traditional or 
neo-gestalt viewpoint (3, 10, 13, 14, 
18, 19,20). Hebb (5), however, takes 
the position that although the re- 
sponse is to a relationship, this does 
not support the gestalt position.” 


1 This investigation was supported in part by 
The State College of Washington Research Fund. 
The writer wishes to thank Mr. Garry Margolius, 
research assistant, for collecting part of the data. 

2 Others such as Hilgard (6) confuse the issue 
of transposition with the related one of continu- 
ity by discussing data relevant to the latter to 
evaluate the former. This technique is all the 


The above references (1, 5, 11, 15) 
have restricted themselves almost ex- 
clusively to the transposition experi- 
ment which limits its test to but one 
step removed from the training pair, 
e.g., Ss are first trained on medium 
gray positive versus light gray nega- 
tive and then tested on a medium gray 
opposed to a dark gray. Under cer- 
tain conditions, the responses on such 
atest are tothe dark gray. Since the 
Ss were originally trained on the 
“darker than” and respond likewise 
in the test situation one might assume 
that the original discrimination was 
learned on the basis of responding to a 
relationship or configuration rather 
than to the absolute properties of each 
cue. Such assumptions, however, are 
neither logically demanded nor neces- 
sarily the most appropriate ones. 

In their well-developed gestalt the- 
ory, Gulliksen and Wolfle discuss the 
illegitimacy of the relational assump- 
tion: 

Transfer on a relational basis is . . . in- 
terpreted as evidence that learning was on 

a relational basis. It has now been dem- 


onstrated that relative transfer can be pre- 
dicted from a theory based on a directional 


more surprising in view of the more adequate 
treatment contained in the earlier Hilgard and 
Marquis publication (7). 
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definition of the response. Since this pre- 
diction is possible, it is not legitimate to use 
the fact of relational transfer to support the 
assumption that the animal’s original learn- 
ing was relational in nature (4, p. 240). 


Their formulation, however, does not 
account for instances of either choice 
scores or reversals in the test (trans- 
fer) situation. 

A theory more capable in this re- 
spect is that proposed by Spence (16, 
17, 18, 19). 


His theory, unlike the various Gestalt 
interpretations, does not assume that the 
animal is responding to some abstract rela- 
tional property, or to a primitive structure- 
process. He holds, on the contrary, that 
the positive member in the stimulus-series 
acquires excitatory properties as a result of 
training to the response of approaching it 
and, furthermore, that there is a gradient of 
generalization of this excitation to other 
members of the stimulus-series, including 
the negative stimulus-object. With failure 
of reenforcement of response to the latter, 
however, an inhibitory tendency develops 
which, in turn, is assumed to generalize to 
adjoining members of the stimulus-series 


(19, p. 223). 


The theoretical picture presented 
by Spence has successfully predicted 
not only the success or failure of trans- 
position when tests are conducted on 
logarithmically equal whole-step stim- 
ulus pairs but also generates testable 
hypotheses concerning probability of 
transposition when the tests are not 
so restricted. In other words, the 
theory does not demand that the dif- 
ference between the members of the 
test pair be equal to the difference 
employed in the original discrimina- 
tion. Nor are the tests necessarily 
limited to such pairs as are whole-step 
units removed from the training pair. 
Employing a stimulus series that 
varies in a single dimension, the theory 
should predict the relative amount of 
transposition on any pair after train- 
ing on any other pair. Theoretical 
confirmation from tests employing 


partial steps has been reported by 
Gulliksen (3). The prediction that 
transposition tends to decrease as 
the difference between the training 
and test pairs increases in whole-step 
units has also been confirmed (3, 12, 
13, 14, 18, 19). However, the nature 
of the function combining these two 
situations has received little experi- 
mental attention. 

In the present investigation, these 
two situations were combined by first 
training a large group of Ss on a 
brightness discrimination in which the 
difference between the members was 
fairly large. Upon reaching the cri- 
terion of learning each S was tested on 
one of four new pairs of stimuli differ- 
ing along a brightness dimension at 
various partial steps distant from the 
original. 


METHOD 


Subjects —The Ss were 80 albino male rats 
from the colony maintained by the psychology 
department of the State College of Washington. 

Apparatus—The apparatus was a simple 
T-maze design, the stem of which was walled 
and covered with hardware cloth. The arms 
were open as in an elevated maze. The floor 
consisted of a grid of 4-in. wire spaced $ in. apart. 
Each arm, from the center of the choice point, 
could accommodate a 4 X 10-in. stimulus card. 
At the end of each runway, a glass food cup 
(furniture coaster) could be fastened to a two- 
pronged wire holder in such a way as to be in- 
visible from the choice point. A T-shaped 
aluminum door at the choice point prevented 
both retracing and correction. The sole illumi- 
nation was provided by a 20-w. fluorescent lamp 
shielded by two thicknesses of transluscent 
tracing paper. The stimuli for the brightness 
continuum consisted of eight 4 X 10-in. sheets of 
060-gauge cellulose acetate plastic. These were 
purchased from the Container Corporation of 
America and are identical to the brightness 
series employed in the second edition of the 
Color Harmony Manual.? The main advan- 
tages of this material lie in its rigidity, homogene- 


3 Acknowledgment is extended to the helpful 
cooperation of Mr. Walter C. Granville and Mr. 
Carl E. Foss, editors of the Color Harmony 
Manual, for making available the stimuli em- 
ployed in the present investigation. 
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TABLE 1 


Tratninc Pairs anp Various Test Pairs 
(A to D) Usep in Born Experiments 




















Stimulus Stimulus Cards Stimulus 
Pairs Diffuse Reflection | Ratio 
Exp. I Factors Exp. II 
Training | (4)* .22 (1) .91 | 4.1:1 D 
pair 
A (5) .13 (2) .57 | 4.4:1 Cc 
B (6) .09 (3) .34]3.8:1 B 
Cc (7) .O5 (4) .22 | 4.4:1 A 
D (8) .035 (5) .13 | 3.7:1 | Training 
pair 














* These numbers refer to stimulus cards. 


ous matt surface, and ability to withstand re- 
peated use and washing. The stimuli were cali- 
brated by the Container Corporation as having 
the following diffuse reflection factors on their 
matt (painted) side: .91, .57, .34, .22, .13, .09, 
.05,.035. In the present discussion these stimuli 
will be referred to simply by number from 1 
(white) to 8 (black). 

Procedure—After Ss were tamed and had 
experienced at least a week of being fed an 
8-gm. daily ration of laboratory chow, they were 
introduced to the apparatus. On the first two 
days, each S was allowed to consume ten pellets 
of calf manna on each arm of the maze. During 
this time, the door was closed. On the next two 
days, each S was allowed ten runs from the 
starting box to the food cup where it consumed 
one pellet of food. These runs were forced left 
and right in a balanced random order. Learning 
of the original discrimination was then instituted. 
The appropriate stimulus cards were introduced 
for the first time and each S was given ten non- 
correction trials per day until the criterion of 18 
out of 20 errorless trials, with the last ten error- 
less, was attained. During the training series, 
the food cup was placed only on the rewarded 
side. In Exp. I, Ss were trained on Stimulus 
1 (—) vs. Stimulus 4 (+). In Exp. II, they 
were trained on Stimulus 8 (—) vs. Stimulus 
5 (+). Each S was then tested for two days 
(20 trials) on a new pair A, B, C, or D as indi- 
cated in Table 1. 

On the test trials, all responses were rewarded. 
Thus, in Exp. I, Ss were trained to the darker 
stimulus and transposed down the scale. In 
Exp. II, Ss were trained to the lighter stimulus 
and transposed up the scale. All transposition 
(test) scores are in terms of whether S responded 
in a manner consistent with the direction em- 
ployed in training, i.e., to the darker or to the 
lighter. Ten Ss were tested on each condition 
of each experiment. 


RESULTS 


Trial and error scores for the ori- 
ginal discrimination were fairly low. 
In Exp. I, in which Ss were trained to 
discriminate Stimulus 1 (non-re- 
warded) from Stimulus 4 (rewarded), 
the mean trials and errors were 86.9 
and 42.6. In Exp. II (Stimulus 8 
non-rewarded, Stimulus 5 rewarded), 
the mean trials and errors were 75.5, 
and 34.2. 

The test trials for each test pair in 
both experiments were subdivided in- 
to successive five-trial blocks. The 
mean transposition scores obtained 
on these blocks of trials, as well as the 
total for all 20 trials, are presented in 
Table 2. The trends revealed in this 
table can be analyzed in terms of both 
columns and rows. Inspection of the 
rows, i.e., the subdivided test scores 
for any one test pair, reveals no con- 
sistent trend that will describe all of 
the tests. The scores for Test Pair 
A of Exp. I rise steadily from 86 to 94. 
On the other hand, the scores for Test 
Pair D of the same experiment fall 


TABLE 2 


Per Cent or TRANSPOSITION 
By Five-Triat Biocxs 








Trials 
Tests 





1-5 6-10 | 11-15 16-20 | 1-20 











Exp. I Ss trained on Stimulus I (—) 
os. Stimulus 4 (+) 








Exp. II Ss trained on Stimulus 8 (—) 
os. Stimulus 5 (+) 




















oo 
aS 








100;- 
es 
© 90+ - 
= ee 
= sot ° \ 
fo} 
a 
© 70} 
< XY 
= 60}- \ 5 
™ 50 \ aan 
3 Ww \ i 
40+ e——e Exe! \ Pas 
w ” 
2 30 o-——o EXP II \ a 
« >~ 
cre) 
a 20b 
- A 8 c o 
° l l l } 





TEST GONDITIONS 


Fic. 1. Transposition gradients for Ss 
transposed down the brightness scale (Exp. 1), 
and for Ss transposed up the brightness scale 


(Exp. IT) 


steadily from 66 to 54. Scores for 
other test pairs show various fluctu- 
ations. Apparently any block of 
trials is as appropriate as any other 
as a measure of transposition. The 
various columns within each experi- 
ment, however, all reveal similar 
tendencies. 

The per cents of transposition, 
based on all 20 trials, are presented in 
Fig. 1. It will be recalled that in 
Exp. I, Ss were trained with the 
darker positive and tested on still 
darker pairs. Transposition is high 
on the near test and falls in an 
S-shaped fashion to a value below the 
50% point and then rises above the 
50% point. In Exp. II, Ss were 
trained on the lighter and tested on 
still lighter pairs. The data for Exp. 
II show the same general trends just 
described for Exp. I. A similar 
S-shaped tendency is evident except 
that the scores fall farther below the 
50% point on Test Pair C (significant 
at the 1% level), and Test Pair D is 
still below 50%. 


Discussion 


A recent study by T. S. Kendler 
(12) posed some interesting problems 


DAVID EHRENFREUND 


both in respect to the nature of the 
transposition function and the ade- 
quacy of Spence’s theory. The na- 
ture of the function was difficult to 
determine in the Kendler study due to 
the fact that on any one test pair, the 
later test trials produced consistently 
higher scores than the earlier test 
trials. Therefore, the data were 
plotted using three response meas- 
ures: Test Trials 1-5, 6-10, and 1-10 
(12, Fig. 3 and 4). Inspection of 
these curves reveals that not only are 
they different, but no matter which 
curve is employed as the more ap- 
propriate function, it only partially 
corroborates the expectations from 
Spence’s theory. Thus, although the 
amount of transposition decreased as 
a function of the difference between 
training and test stimuli, the test 
scores did not reach or go below the 
chance level as theoretically expected. 
According to the analysis made above 
for the present set of data, no such 
consistent trends between the sub- 
divided test scores for the same test 
stimuli were discovered. This find- 
ing, coupled with the fact that the 
transposition function is essentially 
similar no matter which block of test 
trials is employed, justifies acceptance 
of the curves in Fig. 1 as at least close 
approximations to the empirical law 
relating amount of transposition to 
the distance between training and 
test stimuli along a brightness dimen- 
sion. 


Although the Kendler study was similar in 
many respects to the present one, she used direct 
lighting, and several complications attendant 
upon the use of direct rather than reflected light 
could easily account for the differences between 
the findings of her study and the present one. 
Some, such as the problem of dark adaptation 
and the changing efficiency of the rat’s eye as 
trials continue, were discussed by Kendler. 
The possible necessity for positing “other factors 
in the present [Kendler] training or test situa- 
tions in addition to those posited by Spence to 
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determine the final results” (12, p. 561) was 
suggested in a footnote but was not elaborated. 
These “other factors,” which seem to the present 
author to be pertinent and peculiar to the direct 
illumination situation as employed by Kendler, 
include: 


1. The change in extra-maze cues when S is 
shifted from the training to the test pair of 
stimuli. The use of direct illumination in an 
otherwise non-illuminated room results in a 
change in room cues from few to many or vice 
versa depending on the direction of training and 
testing. This change would, of course, be 
greatest for those groups tested on the farthest 
pair and least for those tested on the nearest 
pair. 

2. Employing stimulus cues having drive 
value. The level of illumination employed on 
Test Pair 5 for Kendler’s Cond. 1, or for training 
in Cond. 2, was of a magnitude comparable to 
that employed in other studies as noxious stimuli 
(21). 


Thus, although the use of a situation employ- 
ing direct illumination may not be the most ap- 
propriate one with which to test the specific 
assumptions posited by Spence to account for 
transposition, the Kendler study suggests a de- 
gree of complexity perhaps not heretofore ap- 
preciated and which merits further study, par- 
ticularly in relation to attempts at more elabo- 
rate theoretical formulations. Furthermore, as 
Kendler points out, her results can be described 
within the framework of S-R reinforcement 
theory by employing, in addition to Spence’s 
postulates, Hull’s concepts of patterning of 
stimulus compounds (8) and his recently formu- 
lated stimulus dynamism (9). 


Turning now to the implications of 
the present set of data for Spence’s 
theory, we find a close approximation 
between prediction and results. Par- 
ticular note should be made of the 
fact that the low point on each curve 
occurred on Test Pair C which is only 
one step removed from the training 
pair. In Exp. I, Test Pair C was 
composed of Stimulus 4, the positive 
stimulus employed in the training 
situation, and a new Stimulus 7. In- 
stead of responding to Stimulus 7, as 
predicted by some form of relational 
theory, 56% of the responses were to 
the absolute stimulus, i.e., Stimulus 4. 
One might say that since the response 


was about chance, neither theoretical 
viewpoint was supported. According 
to CR theory, however, the amount 
of transposition will reach a chance 
level at some point, and under certain 
conditions will fail to occur, i.e., the 
response will be to the absolute stimu- 
lus (19,20). Apparently this is what 
happened in Exp. II, where Test Pair 
C was composed of Stimulus 5, the 
reinforced stimulus of the training 
situation, and a new Stimulus 2. In 
this instance, 75% of the responses 
were to the absolute stimulus. Fur- 
thermore, nine of the ten Ss responded 
better than 50% and three Ss re- 
sponded 100% to the absolute stimu- 
lus. 

Although both transposition curves 
are in line with the general prediction 
from the theory, the difference be- 
tween them may be important in 
terms of certain theoretical refine- 
ments. The generalization gradients 
of excitation and inhibition posited 
by Spence to account for transposition 
are symmetrical for both directions of 
the stimulus continuum (20). A re- 
cent study by Grice (2) indicates that 
for size, at least, such may not be the 
case. He found that the generaliza- 
tion gradient to the “larger” end of 
the continuum was of the same gen- 
eral shape as posited by Spence. 
However, toward the “smaller” end 
of the continuum, the gradient was 
discovered to be an exponential type 
not unlike that posited by Hull (8). 
There is also the question whether the 
generalization gradients for inhibition 
may be asymmetrical. Spence has 
suggested that “. . . the shape of the 
generalization curve [of inhibition ] in 
the direction of larger stimulus-areas 
differs from that of the smaller” (19, 
p. 229). 

If asymmetrical generalization gra- 
dients may be expected for the bright- 
ness dimension, as well as the size 
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dimension, then the difference be- 
tween the two transposition gradients 
of the present study conforms to that 
expectation. In Exp. II there was a 
greater failure of transposition than in 
Exp. I. In fact, these data are al- 
most exactly as predicted: 


That is, according to this schema, the per 
cent of responses consistent with the direc- 
tion of the original training should at first 
decrease until they drop to well below the 
50 per cent leve! (i.e., lead to a reversal of 
the originally learned preference) and sub- 
sequently rise and level off at a response 
value, presumably 50 per cent, where the 
difference between the theoretical effective 
excitatory strengths becomes zero (12, pp. 
560-561). 


Thus, it appears, from the present 
set of data, that Spence’s theoretical 
generalization gradients are more ap- 
propriate for predicting transposition 
up the brightness scale than down. 
In Kendler’s (12) study, the form of 
the curve was also more in line with 
the theory when Ss were transposed 
up the brightness scale rather than 
down. 


SUMMARY 


Two transposition experiments em- 
ploying reflected light were conducted. 
In Exp. I, 40 albino rats were trained 
to respond to the darker of two bright 
stimuli and tested at various partial 
steps down the brightness scale. In 
Exp. II, 40 albino rats were trained 
to respond to the brighter of two dark 
stimuli and tested at various partial 
steps up the brightness scale. In 
each experiment, four tests were 
carried out at points increasingly dis- 
tant from the training pair. Both 
transposition curves confirmed the 
general prediction from Spence’s the- 
ory to the effect that transposition is 
less perfect as the difference between 
training and test pairs is increased. 

The low point on each transposition 
gradient, which was either chance 
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(Exp. I) or below chance (Exp. II), 
occurred on Test C which was only 
one step removed from the training 
pair. Thus, the data from Exp. II 
are more in line with the theory, being 
almost exactly as predicted. The 
differences between the two sets of 
data were discussed in terms of the 
possible need for positing asymmetri- 
cal generalization gradients. 


(Manuscript received April 21, 1951) 


iS. 


14. 


REFERENCES 


- Dasure.t, J. F. Fundamentals of general 


psychology. Cambridge, Mass.: Hough- 
ton Mifflin, 1949. 


. Grice, G. R., & Sattz, E. The generaliza- 


tion of an instrumental response to stimuli 
varying in the size dimension. /. exp. 
Psychol., 1950, 40, 702-708. 


. Gutusen, H. Studies of transfer of re- 


sponse: I. Relative versus absolute fac- 
tors in the discrimination of size by the 
white rat. J. genet. Psychol., 1932, 40, 
37-51. 


. Guiirxsen, H., & WotFte, D. L. A theory 


of learning and transfer: II. Psycho- 
metrika, 1938, 3, 225-251. 


- Hess, D. O. The organization of behavior. 


New York: John Wiley, 1949. 


. Hircarp, E.R. Theories of learning. New 


York: Appleton-Century-Crofts, 1948. 


. Hizearp, E. R., & Marquis, D.G. Condi- 


tioning and learning. New York: D. 
Appleton-Century, 1940. 


. Hutt, C. L. Principles of behavior. New 


York: D. Appleton-Century, 1943. 


. Huut, C. L. Stimulus intensity dynamism 


(V) and stimulus generalization. Psy- 
chol. Rev., 1949, 56, 67-76. 


. Jones, F. N. The stepwise phenomenon in 


rats. J. comp. Psychol., 1939, 27, 39-44. 


. Keurer, F. S., & Scuoenretp, W. N. 


Principles of psychology. New York: 
Appleton-Century-Crofts, 1950. 


. Kenpier, T. S. An experimental investi- 


gation of transposition as a function of 
the difference between training and test 
stimuli. J. exp. Psychol., 1950, 40, 552- 
562. 

Kutver, H. Behavior mechanisms in mon- 
keys. Chicago: Univ. Chicago Press, 
1933. 

Kuenne, M. K. Experimental investiga- 





15. 


16. 


17. 


18. 





Oo => CF YE ve OUmMNU! 


Cw 








TRANSPOSITION GRADIENT 87 


tion of the relation of language to trans- 
position behavior in young children. /. 
exp. Psychol., 1946, 36, 471-490. 

15. Munn, N. L. Psychology. Cambridge, 
Mass.: Houghton Mifflin, 1946. 

16. Munn, N. L. Handbook of psychological 
research on the rat. Cambridge, Mass.: 
Houghton Mifflin, 1950. 

17. Spence, K. W. The nature of discrimina- 
tion learning in animals. Psychol. Reo., 
1936, 43, 427-449. 

18. Spence, K.W. The differential response in 
animals to stimuli varying within a single 


dimension. Psychol. Rev., 1937, 44, 
430-444. 

19. Spence, K.W. The failure of transposition 
in size discrimination of chimpanzees. 
Amer, J. Psychol., 1941, 54, 223-229. 

20. Spence, K. W. The basis of solution by 
chimpanzees of the intermediate size 
problem. J. exp. Psychol., 1942, 31, 
257-271. 

21. Zeaman, D., & House, B. J. Response 
latency at zero drive after varying num- 
bers of reinforcements. /. exp. Psychol., 
1950, 40, 570-583. 











ROTE LEARNING AS A FUNCTION OF DISTRIBUTION OF 


PRACTICE AND THE COMPLEXITY OF THE SITUATION! 


DONALD A. RILEY 
University of California 


The purpose of this study is to in- 
vestigate the relationship between 
distribution of practice and the com- 
plexity of the situation in a verbal 
rote learning task, where complexity 
is defined in terms of the number of 
possible alternative responses which 
may be made to each stimulus. 

Although Hovland (10,11) has 
presented evidence that distribution 
of practice is superior to massed prac- 
tice in serial and paired-associate 
learning situations in which the dis- 
tribution intervals are of brief dura- 
tion, investigators using conditions of 
greater complexity have found con- 
tinuous practice superior to rest in- 
tervals of several hours between trials. 
Cook investigated the effects of dis- 
tribution of practice in mazes of vary- 
ing complexity, and found that in all 
situations an interval of 20 to 30 sec. 
was superior to a 4-hr. interval be- 
tween trials, the relative superiority 
of massed practice being greater for 
the situations with the greater num- 
ber of alternatives at each choice 
point. 

Ericksen (6) compared the relative 
efficiency of continuous practice ver- 
sus a brief practice period every other 
day in the learning of a multiple 
U-maze and a McGeoch puzzle box. 
On the basis of studies (3, 5, 9, 22) 
which had found that in rats, massing 
of trials brings about an increase in the 


1 This paper is a portion of a dissertation sub- 
mitted to the graduate school of The Ohio State 
University in partial fulfillment of the require- 
ments for the Ph.D. degree. The author wishes 
to express his appreciation to Drs. A. W. Melton, 
D. D. Wickens, and R. J. Wherry for their advice 
and assistance. 


variability of behavior in a T-maze, 
when two trials a day are given, he 
predicted that the maze problem 
should profit by distribution because 
it is a problem which, in the initial 
stages of the learning process, maxi- 
mizes the fixative aspects of the 
process, while the puzzle problem 
should profit most by massing, be- 
cause it is a problem which maximizes 
the variable aspects of the learning 
process in the initial stages of the prob- 
lem. His prediction was confirmed, 
although no evidence was presented 
to show that variability of response 
was increased as a result of massing. 
That variability of response should 
increase with massing would seem to 
be a necessary consequence of the 
original assumption. Another pre- 
diction which would follow from the 
hypothesis that massing increases 
variability is that in simple situations, 
learning in the early trials would prof- 
it most by distributed practice; 
whereas, in the more complex situ- 
ations, learning in the early trials 
would profit most by massed practice. 
This prediction would follow because 
in more complex situations increasing 
the variability of the response would 
increase the probability of discovering 
the correct response, whereas in 
simpler situations increasing the vari- 
ability of response would increase the 
probability of giving a response other 
than the correct response after the 
correct response had been given. Dis- 
tribution of practice should become 
more beneficial in the later stages of 
learning, where variability would in- 
terfere with the final fixation of the 
correct responses. Cook (1, 2) found 
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massing superior only in the early 
trials in puzzle solving, but in the 
maze studies he found massing supe- 
rior throughout the learning process. 
Other earlier investigators (14, 17) 
found early distribution superior in 
simple situations with both humans 
and rats as Ss. 

In consideration of the wide differ- 
ences in Ss, materials to be learned, 
and methods of training, which are to 
be found in the various studies bearing 
on the problem of distribution of prac- 
tice, it is not surprising that discrep- 
ant results have been obtained. The 
aim of this study is to hold the dis- 
tribution intervals similar to those 
used in the Hovland studies and to 
make the simple condition compara- 
ble to the conventional paired-associ- 
ate learning situation. Complexity 
is to be defined as the extent to which 
the discovery process is involved in 
the learning process, i.e., the greater 
the number of alternatives presented 
to the individual at each choice situ- 
ation, the greater the complexity of 
the situation (18). This dimension 
is varied in a stepwise fashion. 

The experiment being reported 
deals with the following questions: 
(a) What is the relationship between 
distribution of practice and the com- 
plexity of the situation as measured 
by learning and retention scores? 
(b) Is there evidence for greater 
variability of response in the massed 
conditions, and if so, what effect does 
this greater variability have on the 
efficiency of learning in the early and 
later parts of the learning process? 


PROCEDURE AND MATERIALS 


Terminology.—The terms used to identify the 
various conditions will aid in describing the pro- 
cedure. There were six groups of Ss, all of which 
were required to learn the same eight paired- 
associates by the anticipation method. Three of 
these groups learned under massed practice (8 
sec. between trials) and will be referred to as 


M-I, M-II, and M-IV. M-I was the standard 
paired-associate situation, wherein each stimulus 
term was followed by only one response term. 
M-II was the same except that each stimulus 
term was followed by two response terms only 
one of which was correct. In M-IV, each stimu- 
lus term was followed by four response terms, 
only one of which was correct. The correspond- 
ing distributed practice groups (2 min. between 
trials) will be designated D-I, D-II, and D-IV. 

Materials.—The stimulus elements were eight 
of the nonsense symbols used by Gibson (8).? 
The response terms were, depending on the ex- 
perimental group, 1, 2, or 4 nonsense syllables. 
These were presented on a paired-associate 
memory drum in such a manner that the stimu- 
lus first appeared alone for 2 sec., then the stimu- 
lus and the response (or possible responses) ap- 
peared together for 2 sec., then the cycle re- 
peated with a different symbol and different 
syllables. 

All syllables were between 26% and 27% as- 
sociation value.* They were selected to be as 
dissimilar to each other as possible. Of the 
eight correct syllables, none had the same first 
letter, nor the same last letter, and none had two 
letters in common. These were the same syl- 
lables for each group. The symbols and their 
respective syllables appeared in four different 
sequences to eliminate serial effects. In Groups 
M-II and D-II, where each symbol was accom- 
panied by two syllables, each syllable appeared 
in the first position half of the time, each ap- 
peared in the second position half of the time. 
In Groups M-IV and D-IV each of the four syl- 
lables accompanying any symbol appeared in 
each of the four possible positions one-fourth of 
the time. Thus, if the four possible R terms for 
S; first appeared in the order Rie, Rw, Ric, Ria, 
they might next appear in the order Ric, Rie, 
Rua, Rw. 

Subjects —Each of the six groups contained 
20 Ss, selected at random from students in ele- 
mentary psychology having Ohio State Psycho- 
logical Examination scores above percentile 30.‘ 


2 The reason for using nonsense symbols for 
the stimulus terms instead of nonsense syllables 
was that it was hoped that interference due to 
the many conflicting nonsense syllables being 
used in the more complex conditions would be 
somewhat lessened. 

* The syllables were taken from lists of non- 
sense syllables for paired-associate learning 
studies prepared by A. W. Melton. 

‘In a pilot study, 3 out of 30 Ss failed to learn 
the material. They were the only Ss with 
OSPE’s below percentile .30. It was hoped 
that a matching procedure using the OSPE as a 
criterion could be used, but after half of the Ss 
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All Ss were naive to psychological experimenta- 
tion, and did not know the purpose of the experi- 
ment. Five Ss were discarded for failure to 
learn. All had 80 trials and did not appear to be 
approaching the criterion. The slowest Ss to 
learn required 56 trials. The five eliminated Ss 
were distributed as follows: M-IV—2, D-IV—2, 
M-II—1. They were replaced by other Ss. 

Procedure.—The Ss learned the lists to one 
perfect recitation by the method of anticipation 
on the first day, returned the next day at the 
same time, and relearned to the same criterion. 
Learning was preceded by rather elaborate in- 
structions and demonstration of procedure to 
compensate for the lack of a practice period. 
The procedure varied according to distribution 
in that in the distributed groups, 1 min. and 56 
sec. of the 2-min. interval between each trial was 
filled by naming colors from a large board con- 
taining 60 color squares, arranged in ten columns. 
The Ss named the colors in rhythm with a 
metronome which beat once every 2 sec. until 
the Ss’ rhythm was established. Any deviation 
from the 2-sec. rhythm resulted in the metro- 
nome beating again. In the massed condition, 
the 8-sec. interval between trials was unfilled. 
The Ss were required to spell out the syllables, 
and the noncorrection method was used to hold 
the work per trial constant as the situations in- 
creased in the degree of discovery required. 
Evocation of the correct response caused a 
buzzer to sound, indicating correct, whether the 
S anticipated the correct response or merely 
said it after it appeared. This was necessary in 
order to inform Ss in the more complex condi- 
tions when they had located the correct response. 
After S had anticipated all syllables correctly in 
the same trial, he was requested not to think 
about the material until he returned the next 
day. 


RESULTS 


As may be seen from Table 1, the 
D groups took fewer trials to reach 
the criterion in all three conditions. 
The over-all F (4.10) between the 
massed and the distributed conditions 
is significant at the 5% level of con- 
fidence, and the consistency of the dif- 
ferences between the means would 
support the assumption that the mag- 
nitude of the F ratio is due, in part, 
to the differences between these 


had been run in the experiment, no significant 
correlation between OSPE’s and learning scores 
emerged, so the selection on a random basis was 
continued. 
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TABLE 1 


Mean Numser or TRIALS To ONE 
Perrect RECITATION 








Massed Distributed 
Choices 





Mean SD Mean SD 


I 26.85 9.45 21.05 7.51 
II 37.45 13.72 | 34.00 8.85 
IV 40.70 6.32 | 39.40 7.99 




















means. However, as Bartlett’s test 
for homogeneity of variance yields a 
x? of 21.37 (p < .01) and as the group 
means and group variances do not 
covary in any systematic way, no 
statement concerning the probability 
of such differences occurring can be 
made. The size of the differences be- 
tween Groups M-I and D-I, and be- 
tween these groups and the other 
groups, indicates the possibility that 
a large part of the variance is attribut- 
able to these differences. The use of 
t tests between the groups would, if 
legitimate, support this interpretation, 
but the fact that the interaction vari- 
ance does not approach significance 
makes this interpretation tenuous at 
best. 

Figure 1 indicates for each group 
the mean number of trials required to 
anticipate correctly one nonsense syl- 
lable, to anticipate two in the same 
trial, to anticipate three in the same 
trial, and so on until all eight nonsense 
syllables have been anticipated in the 
sametrial. Examination of the figure 
indicates that the trends of M-I and 
D-I are divergent, the trends of M-II 
and D-II are approximately parallel, 
and that the trends of M-IV and 
D-IV at first diverge, then converge. 
This is taken to indicate that as there 
is an increase in the degree of dis- 
covery required, the effectiveness of 
the distribution of practice appears 
relatively earlier in the learning proc- 
ess. This seems to be in conflict with 
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the hypothesis that in the early stages 
of learning, massing of practice should 
benefit those situations requiring a 
greater degree of discovery. 

In order to analyze more precisely 
the data with respect to the location 
of the beneficial effects of distribu- 
tion, the data were divided into two 
parts. The first, which will be called 
the discovery period, is, for each S, 
the mean number of trials required to 
anticipate first each correct nonsense 
syllable. The second, which will be 
called the fixation period, is obtained 
for each S by taking the number of 
the trial following the last failure of 
anticipation for each of the correct 
syllables, subtracting from this value 
the corresponding discovery value, 
and averaging the eight values. This 
will be recognized as an averaging of 
the range of oscillations at the thresh- 
old of recall used by Hull et al. (13), 
which is presumably an index of the 
interference between the items in a 
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Fic. 1. Mean number of trials to various num- 
bers of correct anticipations within a trial 


TABLE 2 


Mean Numser or TRIALS To THE First 
Correct ANTICIPATION or Eacu SYLLABLE 





Massed Distributed 
Choices 





Mean SD Mean SD 





I 13.55 4.63 10.72 
II 23.38 9.41 20.95 
IV 26.53 5.04 23.84 


wnt 
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task (at least in serial learning situ- 
ations). 

Table 2, which shows the results of 
the analysis of the discovery period, 
indicates a uniform statistically sig- 
nificant superiority of the distributed 
conditions. The F of al] conditions 
was 21.80 (p < .001); for distribution 
F was 5.8 (p < .05); and for degree of 
discovery required F was 51.50 (p < 
.001). A series of steps was taken 
in an effort to determine whether the 
superiority of the distributed groups 
was related to differences in the vari- 
ability of response. The data in the 
II and IV conditions were analyzed 
with respect to the number of response 
shifts from trial to trial, response 
shifts being defined as any change in 
the response to a stimulus from one 
trial to the next, regardless of the 
degree of the change. The I condi- 
tions were not included because the 
response shifts were negligible. Table 
3 indicates that the hypothesis of no 
difference between M and D condi- 
tions cannot be rejected (F = .01), 
and that no evidence for massed prac- 
tice as a determiner of variability has 
been found by this procedure.® 

As a final step in the analysis of the 
discovery scores, the number of cor- 

5 The possibility that the variability may have 
operated in some differential way was consid- 
ered, and response shifts away from the correct 
responses were counted, but the over-all F had a 


p > .05 and no consistent trends were in evi- 
dence. 
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TABLE 3 


Mean Response SHIFTs PER SYLLABLE PER 
Triat AND Mean Correct RESPONSES 
PER Exposure TO First CorrEcT 
ANTICIPATION FoR Eacu SYLLABLE 








Massed Distributed 





Choices} R Shifts/ |Correct R/| R Shifts/ |Correct R/ 
Syl./Trial | Exposure | Syl./Trial | Exposure 





Mean} SD |Mean| SD | Mean| SD |Mean| SD 





II -26 | 068] .67 | .091| .31 | .069| .63 | .220 
IV -54 | .069] .40 | .087/ .53 | .081) .39 | .061 





























rect responses made during the dis- 
covery period was divided by the num- 
ber of trials in the discovery period. 
These data, also summarized in Table 
3, revealed no difference between the 
means of the massed and distributed 
conditions, analysis of variance yield- 
ing an F of 2.24. 

The data from the fixation period 
are shown in Table 4. The analysis 
of variance on all groups gave an F of 
2.28, which is short of the 5% level of 
confidence, but is of considerable 
interest as it indicates the possible 
locus of the superiority of massed 
practice over distributed practice. 
Inspection of the means shows that in 
the M conditions increasing complex- 
ity required fewer fixation trials, while 
in the D conditions increasing com- 
plexity required more fixation trials. 


TABLE 4 


Mean Numser or TRIALS FROM THE First 
Correct ANTICIPATION oF Eacu SYLLABLE 
TO THE TRIAL FoLLtowinc THE Last 
Fatture to Anticipate Eacu 
SYLLABLE 





Massed Distributed 
Choices 





Mean SD Mean SD 





I 6.84 3.75 4.05 3.59 
II 6.61 4.86 5.72 3.76 
IV 5.84 2.82 6.17 3.74 
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On the basis of these findings, we 
may conclude that the general superi- 
ority of distributed practice is in large 
part due to effects operating early in 
the learning process. As practice 
continues, these beneficial effects of 
distribution tend to disappear, and 
it is possible that in the more complex 
conditions there is actually a reversal, 
the massed conditions becoming more 
beneficial. 

Retention after 24 hrs. was meas- 
ured both by recall and by relearning. 
The results are shown in Table 5. 


TABLE 5 


Mean ANTICIPATIONS ON First RETENTION 
Tria, AND Mean TrIAts To RELEARN 
to One Perrect RECITATION 








Correct Responses on Mean Trials to 
First Relearning Trial Relearn 





Choices Distrib- Distrib- 
Massed uted Massed uted 





Mean} SD |Mean| SD | Mean| SD |Mean| SD 





I 4.05 | 1.53} 5.10 | 1.22) 4.6 | 3.0| 2.6 | 2.7 
II 3.35 | 1.71] 4.15 | 1.94) 5.4 | 3.4] 4.2 | 2.6 
IV 3.45 | 1.99] 4.60 | 1.83) 6.0 | 3.7 | 4.4 | 3.6 





























They fail to disclose significant vari- 
ation either in recall or in relearning, 
although in all cases the means favor 
the distributed conditions. 


Discussion 


In general, the results of the experi- 
ment do not support previous studies 
which have found massing superior in 
situations requiring relatively large 
degrees of discovery. There are three 
obviously important differences be- 
tween this study and the studies of 
Cook (1, 2) and of Ericksen (6) which 
make it impossible to state the one 
reason for this discrepancy; namely, 
(a) differences in the degree of dis- 
tribution, (b) some differences in the 
degree of discovery (very complex 
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mazes or puzzle situations presum- 
ably requiring a greater degree of dis- 
covery than the complex conditions 
of this study), and (c) differences in 
the nature of the task. Without con- 
siderable investigation, it would be 
impossible to state the optimal de- 
gree of distribution for any given task, 
but it is possible that some of the ex- 
periments referred to found massing 
superior for the simple reason that 
the distributed practice period was too 
distributed, with the negative effects 
of forgetting the required responses 
being greater than the positive effects 
of distribution. Also, extrapolation 
of the trends of the data in Table 1 
and Table 4 indicates the possibility 
that more complex conditions would 
result in the superiority of massed 
practice. 


The factor or factors influencing the superi- 
ority of the D conditions occurred mainly in the 
early trials, a finding not in agreement with the 
assumption of the Ericksen study that early mass- 
ing would aid in tasks which require relatively 
large amounts of variable behavior. Further- 
more, there was no indication of greater vari- 
ability in the M conditions than in the D 
conditions. Although increased variability as a 
result of massing does occur in some rigidly cir- 
cumscribed conditions (4,22), the phenomenon 
has yet to be demonstrated in situations requir- 
ing complex patterns of response which change 
from trial to trial (21). The additional finding 
that the M and D conditions do not differ sig- 
nificantly in the number of correct responses per 
trial in the discovery period seems to indicate 
that the superiority of the D groups is due to 
some factor other than an early identification 
and practicing of the correct response. These 
findings indicate that the prediction of increased 
variability and beneficial effects of distribution 
as a function of the greater accumulation of in- 
hibitory effects with greater massing, as pro- 
posed by Hull (12) and as elaborated by Solomon 
(19), does not extend to the present situation. 
Although it is recognized that accumulated in- 
hibitory effects might account for the slower 
speed of anticipation in the M conditions, the 
fact that the Ss in the D conditions were con- 
tinuously responding during rest intervals makes 
this interpretation dubious in the present case. 

Another hypothesis which has been suggested 
to account for certain beneficial effects of dis- 


tribution of practice is the interference hypothe- 
sis advanced by McGeoch (16), which assumes 
that weak responses are forgotten more rapidly 
than strong responses as a result of the interfer- 
ence of other competing responses. As the rest 
period in distribution creates an opportunity for 
additional responses to compete with and thus 
eliminate the weak responses, there should be 
more correct responses per trial in the D condi- 
tions. That this was not true in the discovery 
period indicates that the differences in perform- 
ances lay in the stage of learning at which antici- 
pation began, and this difference is not due to 
a difference in the speed with which the correct 
response is learned. To state this in a more 
general way, the superiority of the D groups is 
revealed in recall but not in recognition. 

Although the precise factors associated with 
this result are not indicated in the present study, 
Gibson’s (7) theory of verbal learning would sug- 
gest the possibility that the beneficial effects of 
distribution would arise either in learning to dis- 
criminate the stimuli or in the process of associ- 
ating each S-term with its R-term. Either 
process would provide an opportunity for differ- 
ential forgetting to occur. 

One aspect of the data from the discovery 
scores which requires consideration is the lack of 
apparent change in the superiority of distributed 
practice with increases in the degree of discovery 
required. According to the original hypothesis, 
there should have been an increase in the effec- 
tiveness of massing. According to the interfer- 
ing-response hypothesis, if increased discovery 
requirements bring increased interference, then 
distribution of practice should show an increas- 
ing superiority. Neither of these assumptions 
is supported. Apparently, increasing the num- 
ber of choices from one to two to four increases 
the difficulty of discovering the correct response, 
but does not appreciably affect the amount of 
intra-list interference. This is also indicated by 
the trends of the fixation scores. McClelland 
(15) has reported similar results and has offered 
an explanation to the effect that generalization 
between response terms decreases rather than 
increases when the choice is between right and 
wrong responses rather than competing correct 
responses. 

Consider the data from the fixation period. 
As was indicated, the groups do not differ sig- 
nificantly, but the trends are of some interest, 
and they are consistent with the above explana- 
tion of the beneficial effects of distribution. In 
the discovery period, the superiority of the dis- 
tributed condition was present in all conditions, 
but this superiority continued in the fixation 
period only in the case of M-I and D-I, these 
groups differing from the other groups in that Ss 
in the I groups had less experience with the stim- 
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ulus terms before they started to anticipate the 
responses. This occurred because there was less 
of a problem in learning the correct responses, 
and anticipations could be made before the stim- 
ulus terms were highly differentiated from each 
other. Thus, some of the task in which dis- 
tribution was beneficial remained to be done in 
the fixation period. In the II and IV conditions, 
the location of the correct responses took pro- 
gressively more trials; consequently more prac- 
tice with the stimulus terms occurred in the dis- 
covery period and there was less opportunity for 
beneficial effects of distribution to occur in the 
fixation period. Another factor which was prob- 
ably operating to lower the effectiveness of dis- 
tribution of practice was fatigue. It should be 
emphasized that the distribution conditions re- 
quired 2.53 min. from the start of one trial to the 
start of the next, whereas the massed conditions 
required .67 min. from the start of one trial to 
the start of the next. The combination of these 
two factors seems sufficient to account for the 
trends during the fixation period. This inter- 
pretation would not account for the results of 
other investigators who found massing superior 
in complex situations, as the rest periods in these 
other studies were not filled, and there is no rea- 
son to assume that fatigue should have been 
operating in these cases. 

The slight, statistically insignificant, but very 
consistent mean superiority of the distributed 
conditions in both recall and relearning does not 
indicate any change in the relationship between 
distribution of practice and degree of discovery 
after 24 hrs. This study does not throw any 
particular light on the problem of the superior- 
ity of distribution of practice in retention, there 
merely being an indication that this phenom- 
enon apparently exists in more complex condi- 
tions as well as in the simplest rote learning 
situations. 


SUMMARY 


1. The purpose of the experiment 
was to investigate the relationship be- 
tween the number of possible re- 
sponses to each stimulus and distribu- 
tion of practice, where the conditions 
of distribution were comparable to a 
number of other recent studies in the 
field of rote learning. Data were 
collected to test the assumption that 
massed practice becomes more effec- 
tive as the number of response choices 
to each stimulus increases, this in- 
creased effectiveness being a function 
of the increased variability resulting 
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from the massing with a consequent 
increased efficiency in locating the 
correct responses. 

2. Three groups of Ss learned, with 
massed practice, to give nonsense syl- 
lable responses to nonsense symbols, 
8 sec. being interpolated between each 
trial. These groups differed in that 
only one syllable for each symbol was 
exposed for Group M-I whereas two 
syllables appeared with each symbol 
for Group M-II and four syllables ap- 
peared with each symbol for Group 
M-IV. In each case only one syllable 
was correct for each symbol, and the 
task of S was io learn to give the cor- 
rect syllables bj the anticipation 
method. Three corresponding groups 
learned with distribu:ed practice (2- 
min. rest between trials, filled by 
reading colors from a_ board). 
Twenty-four hours after learning to a 
criterion of one perfect trial, retention 
was tested by relearning to the same 
criterion. 

3. No evidence for greater vari- 
ability of responses under either con- 
dition was found, nor was there any 
evidence that the correct responses 
were more rapidly located or learned 
more effectively under either condi- 
tion. 

4. The distributed conditions were 
superior to the massed conditions in 
the anticipation of the correct re- 
sponses. This is most strikingly evi- 
denced in the mean number of trials 
to the first correct anticipation for 
each nonsense syllable, but is also 
true in trials to the criterion of mas- 
tery. Retention after 24 hrs. shows 
trends indicating the same consistent 
superiority of the distributed condi- 
tions. 

5. The major beneficial effect of 
the distribution of practice comes 
early in the learning process. There 
are trends which are not statistically 
significant that indicate a possibility 
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that massing may be beneficial late in paired-associate learning. J. exp. Psy- 
learning when there are a number of chol., 1939, 25, 622-633. abi si 
shnsenation vemeanse 1. Hovianp, C. I. Experimental studies in 
P , rote learning theory. VIII. Distributed 
° ‘ , practice of paired associates with varying 
(Manuscript received April 16, 1951) rates of presentation. J. exp. Prychol., 
1949, 39, 714-718. 
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THE DECLINE OF PITCH DISCRIMINATION WITH TIME 


J. DONALD HARRIS 
U. S. Naval Medical Research Laboratory 


Interest in the matter of pitch 
discrimination has been increasing 
sharply in very recent years, with 
developments in our knowledge of how 
the auditory nerve responds (and even 
how its individual fibres respond) to 
pure tone stimulation, and with 
developments in various techniques 
for studying the sharpness of tuning 
of the basilar membrane. 

In this paper are reported experi- 
ments in which O hears two tones 
separated in time, and is required to 
judge the second “higher” or “lower.” 
The question is whether the judg- 
ment on the second tone is always 
referred to some relatively stable 
standard resulting from the first tone, 
even one presented many hours previ- 
ously, or whether pitch discrimination 
demands close contiguity of the two 
stimuli in time so that interpretation 
can properly utilize such neural con- 
cepts as delay circuits, inhibition, and 
summation. If discrimination does 
not decline during a period of time 
longer than the immediate neural 
events brought on by the first tone, 
then those immediate neural events 
are inappropriate as an explanation. 

The effect of time on judgments of 
pitch has been the subject of several 
papers. The earlier ones were inter- 
ested largely in a study of the “mem- 
ory-image,” while the later ones were 
interested more in the changes in the 
time error than in the changes in 
sensitivity. 


The most illuminating study for the present 
purpose is, oddly enough, the first one, in 1886, 
by Wolfe (9). He used impure tones of 1-sec. 
duration as standard stimuli, and comparison 
stimuli differing by 0, 4, 8, and 12 cps. The 
inter-stimulus intervals ranged up to 180 sec., 
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the larger part of the data being collected at 60 
sec. and less. It is important to note that Wolfe 
guarded against forming a sense of absolute 
pitch, by using up to 11 different frequencies 
interchangeably as standards. Later workers 
generally have disregarded this vital factor. 
Wolfe found no clear difference among the 
intervals from 1 to 5 sec., but subsequently a 
very slow decline in sensitivity occurred up to 
120 sec., the limit of the experiment. Wolfe 
fitted the curves for three Os by the equation: 


Per cent correct = a. +¢f, 
log t 
where f = frequency, t = time, and & and ¢ are 
constants. 

Using the same instrument to produce clangs, 
Angell and Harwood (2) duplicated in 1899 
much of Wolfe’s experiment. Their results 
confirmed Wolfe’s study in general although 
there were distinct differences in initial ability, 
and in resistance to decline of sensitivity, 
between the two groups of Os. A repetition of 
Angell and Harwood, with the same instrument, 
was done by Whipple (8), with essentially the 
same results. 

In 1914 Anderson (1) studied inter-stimulus 
intervals of 7, 4, 3, 4, 1, 2, 3, and 4 sec. with 
a view to determining the most efficient one for 
discrimination. He concluded that “the in- 
tervals tested are practically equally favorable, 
with possibly a slight tendency in favor of the 
shorter.” 

Irwin (5) in 1937 used intervals of 1, 10, 30, 
and 60 sec., with carefully controlled 256-cps 
tuning forks with acoustic filters. He reported 
changes of differential threshold with respect to 
the l-sec. interval, and found an average decline 
from 1 to 10 sec. of .43 cps. At 30 and 60 sec., 
the data revealed no further regular decline. 

Koester (6) directly attacked the problem in 
1945 with quite adequate apparatus and 
technique, but covered such a range of stimuli 
that on any particular point his data are rather 
scanty. His results with four practiced Os 
with use of the constant method showed no 
difference in threshold between the zero and 1- 
sec. intervals, and an average increase in the 
size of the variable error of about .8 cps from 
1 to 10 sec. His results with 14 unpracticed Os 
at the intervals 1, 3, 6, and 9 sec. were quite 
inconclusive. Even at the shortest interval 
and at the easiest level, 12 cps, this group was not 
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performing better than chance so that it was not 
possible to determine deterioration as a function 
of time. 

For intervals longer than 10 sec., Koester 
chose for each of three practiced Os a difference 
detected only about 80% of the time at zero 
interval, and presented the two stimuli in a 
random sequence, giving a tone every 5, 15, or 
47 sec. in three different series. No significant 
difference occurred up to 15 sec., but deteriora- 
tion was evident at 47 sec. 

Postman (7) is the latest to investigate the 
inter-stimulus interval for pitch, seeking to 
measure the time error, if any. He obtained 
average DL’s for nine Os at 0, 1, 2, 4, and 6 sec., 
at each of seven frequencies. The variability 
shown in his results makes it difficult to see any 
influence of interval. At only three frequencies 
was the 0-sec. interval the most efficient, and at 
only two frequencies was the 6-sec. interval the 
least efficient. 


APPARATUS AND PROCEDURE 


The outputs of two sine-wave oscillators 
were led to the two channels of a “clickless” 
electronic switch, and then to two attenuators, 
a mixing circuit, and a group of 12 matched 
phones. One oscillator (General Radio Model 
913C) had been modified so that it could be 
varied in half-cycle steps. The timing control 
of the switch passed one tone for 1.4 sec., allowed 
an interval of silence from .1 to 25 sec., then 
passed a second tone of 1.4 sec. The timing was 
performed by a Potter Instrument Company 
electronic device, accurate to a millisecond. 
Small frequency steps on the variable oscillator 
were calibrated with a Conn chromatic stro- 
boscope. Zero differences in frequency between 
the oscillators were obtained by use of the null 
point meter provided on one of the oscillators 
(General Radio Model 617C). 

The procedure was a modification of the 
method of constant stimulus differences, using 
two categories, “higher” and “lower.” The 
standard stimulus, at 1000 cps, was always 
given first. Comparison stimuli were chosen 
so that at least ten points were determined 
within the useful range of the psychometric 
function. The constants of the psychometric 
function for each O were determined using the 
Miiller-Urban weights. All functions were 
based on at least 360 judgments, i.e., 36 judg- 
ments per comparison stimulus. 

A 40-item practice series was given to each O. 
Those who scored 11 or more items incorrectly 
(about 5% of the initial group of Os) were not 
used further, since they were so poor in dis- 
crimination that no threshold could have been 
computed for them when using the particular 


comparison stimuli chosen. In the experiment 
proper, systematic practice effects were mini- 
mized by randomizing the  inter-stimulus 
intervals. Since Os were not available for an 
unlimited time, it was not possible to use each O 
to explore all the inter-stimulus intervals 
desirable. It was necessary to rely upon large 
numbers. A total of 213 young men with 
normal hearing acuity was used, in groups of 
12 or fewer. 

The effect on pitch discrimination of elapsed 
time between stimuli has been shown in a 
previous report (3) to depend to a considerable 
extent upon whether the standard stimulus is 
always of the same frequency. When it is, O 
can quickly build up from the preceding stimuli 
a subjective standard upon which he bases his 
judgment of succeeding stimuli. This sub- 
jective standard remains stable for many 
seconds. Consequently, discrimination with a 
single standard stimulus is relatively little 
affected by the time between standard and 
comparison stimuli. 

If, on the other hand, the standard stimulus 
is changed in frequency for every judgment, 
then the range of the standard and variable 
stimuli is considerably greater, the preceding 
sequence of stimuli furnishes orily a relatively 
coarse anchor for judging, and O is forced to 
consider only the two particular stimuli of a 
comparison pair. Under these conditions, it is 
possible to show that time has a measurable 
effect on discrimination. 

Accordingly, in this experiment two pro- 
cedures were used, one with a fixed standard and 
one with a standard varying from 950 to 1050 
cps, in 5-cps steps. It has been shown by using 
the method of single stimuli that the latter 
procedure provides a subjective standard so 
coarse that the DL is greater than 30 cps (4). 


RESULTs AND Discussion 


Because of the greater practical 
importance of pitch discrimination at 
inter-stimulus intervals of a very few 
seconds and less, these times were 
sampled especially carefully. Table 1 
contains data from .3 to 3.5 sec. No 
difference was found within this 
period when a fixed standard was 
employed. This corroborates the 
data of Anderson (1), who found no 
reliable difference on the average or 
for any of four Os at the intervals 
vs, $; i, 3, and | sec. 


For the longer intervals it was 
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TABLE 1 


Mean DirFERENTIAL PitcuH THRESHOLDS 
in CPS wits Various Brier INTERVALS 
BETWEEN Two Tones 














Inter-Stimulus Interval in Seconds 
Group} N 
» 5 1 2 3 3.5 
A 35 6.1 | 6.3 | 5.9 
B 68 | 5.5 4.6 
s Z2 1 Sek 4.75 
D 59 | 5.6 5.0 
E 88 4.75 4.7 
F 87 4.65 | 4.9 
G 35 4.9 4.7 


























possible to use the same Os with all of 
the intervals. One group of 32 Os 
was tested with intervals of 3, 7, 15, 
and 25 sec. with a fixed standard, 
another group of twelve Os was 
tested with intervals of .1, 1, 3, 7, and 
15 sec. with the roving standard. 
The data from these experiments are 
summarized in Table 2 and Fig. 1. 
The data from the tests with a fixed 
standard corroborate in general Koe- 
ster’s (6) results at 0, 1, 5, and 10 
sec. He found an average increase of 
.8 cps after 10 sec.; an interpolation at 
10 sec. in our data gives an increase in 
the DL of .65 cps. Differences in the 
methods employed or individual dif- 
ferences in Os could well account for 


TABLE 2 


Mean DirrerenTIAL Pitcno THRESHOLDS IN 
CPS witrn Various Retativety Lonc 
INTERVALS BETWEEN Two Tones 

















Type of Standard Tone 
Inter- 
oa Fixed (N =32) Roving (N =12) 
(in Sec.) 
Mean SD Mean SD 
0.1 4.2 1.51 
1 4.3 1.20 
3 3.9 1.15 5.1 1.36 
7 4.1 1.06 5.4 1.22 
15 4.7 1.43 7.9 1.60 
25 6.9 1.81 

















the disparity. Unfortunately, the 
earlier writers who used a roving 
standard did not report their results 
in terms of a differential threshold, so 
that further exact comparisons with 
previous work cannot be made. 

When the two curves of Fig. 1 are 
compared, a considerable difference 
in effect of elapsed time is clear. 
With the fixed standard, we appear 
to be studying the relatively slow 
deterioration of the subjective stand- 
ard, which is so stable that discrimina- 
tion has declined by less than 1 cps 
after 15sec. On the other hand, with 
the roving standard we are studying 
the relatively rapid deterioration of 
the after-effects of the standard 
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Fic. 1. Pitch sensitivity at 1000 cps with 
relatively longer inter-stimulus intervals 


stimulus alone. This process results 
in the same decline of discrimination 
after only 4 sec. 

Of especial interest is the compari- 
son between intervals .1 and 1 sec. 
with the roving standard. The longer 
interval results in a nonsignificant 
deterioration of only .1 cps. Evi- 
dently pitch discrimination may be 
based upon neural events which are of 
relatively long duration. 


SUMMARY AND CONCLUSIONS 


Previous experiments on the effect 
of elapsed time between the standard 
and comparison stimuli on pitch 
discrimination have suffered from 
either of two defects: (a) insufficient 
information was obtained to permit 
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the calculation of differential thresh- 
olds; or (b) a simple fixed standard 
stimulus was used so that a stable 
subjective anchor could be built up, 
which permitted O to maintain dis- 
crimination over a long inter-stimulus 
interval. 

The present experiment showed 
that with a fixed standard stimulus, 
no decline in discrimination occurs 
up to 3.5 sec., a decline of .8 cps 
occurs after 15 sec., and a decline of 
3.0 cps occurs after 25 sec. With a 
roving standard stimulus, on the other 
hand, no decline occurs from .1 to 1.0 
sec., but delays of 3, 7, and 15 sec. 
result in changes of .85, 1.1, and 3.7 
cps respectively. It is clear that 
there occurs an appreciably more 
rapid deterioration of the after-effects 
of a single stimulus than of the sub- 
jective anchor built up from repeated 
stimuli close together in frequency. 

Even with a roving standard, which 
allows for the formation of only a very 
coarse subjective anchor, and forces 
O to refer his judgment more com- 
pletely to the immediately preceding 
standard stimulus, no real loss of 
pitch discrimination occurs between 
the inter-stimulus intervals of .1 and 
1 sec. It may be argued that this 
fact suggests an explanation of pitch 
discrimination on the basis of some 
other neural event than a delay 
circuit, inhibition, or summation. It 
seems clear that each single stimulus 


contributes to a subjective standard 
at a level over and above the neural 
after-effects on the basis of which a 
single comparison is made. 


(Manuscript received April 13, 1951) 
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EFFECTS OF FAILURE STRESS UPON 
SKILLED PERFORMANCE 


RICHARD S. LAZARUS AND CHARLES W. ERIKSEN 
The Johns Hopkins University 


The way in which people perform 
under conditions of psychological 
stress is of great interest to psycholo- 
gists. However, there has been rela- 
tively little experimental work on this 
topic. Not only does this general 
problem have important applied im- 
plications for military psychology, 
industry, and education, but it is of 
considerable theoretical importance. 
Stress situations are useful for the 
study of various aspects of human 
learning, personality, and the emo- 
tional processes. 

The majority of interest in the 
field seems to have centered around 
emotional reactions to stress and the 
physiological consequences of stress 
conditions. Much of this work has 
been observational in nature. While 
the physiological and behavioral man- 
ifestations of stress reactions are im- 
portant, it is at least equally import- 
ant to study what happens to a 
person’s performance under stress. 
There has been a tendency to assume 
that, under stress, performance will 
either deteriorate or remain un- 
changed. In fact, most of the ex- 
periments that have been done along 
these lines have produced situations 
which, if performance was studied, 
resulted in losses in effectiveness (2, 
11, 12, 15, 16, 17). We believe that, 
if conditions allow it, performance 
should improve for some Ss and be 
unaffected or impaired in others. 

The study of the effects of stress 
upon skilled performance offers some 
rather difficult methodological prob- 
lems. One of the biggest stumbling 
blocks to effective research is the dif- 


ficulty of producing experimentally 
a realistic threat. One possible cri- 
terion of having stressed Ss is an 
operational one, that is, producing 
some change in performance which 
can only be attributed to the stress 
condition itself. Many Es have sim- 
ply used: situations which were as- 
sumed to be stressful. The most 
frequent techniques have been: (a) 
making it impossible for Ss to solve 
problems which they are motivated to 
solve (1, 9, 12, 13, 14); (b) implying 
failure to finish in the allotted time 
(6); (c) giving Ss false failure scores 
(4, 6); and (d) a miscellaneous group 
of studies involving distractions (11), 
deprivation of desired objects (3), 
threat of electric shock (16), dispar- 
agement by E (10), and excess super- 
vision and control of children’s play 
(5). 

Large gaps exist in our knowledge 
about stress effects. We know very 
little about the importance of the type 
of task. We know little about the 
importance of when in the learning 
and performance of a task the stress is 
imposed and under what conditions of 
stress S must function. Nor do we 
know much about what components 
of performance are affected and in 
what way. We recognize that there 
are great individual differences in re- 
sponse to stress, but we do not under- 
stand the nature of these differences 
or the probable interactions between 
the characteristics of Ss and the con- 
ditions of stress. 

In the present experiment we have 


1The authors are endebted to Dr. Jack R. 


Strange for his assistance in collecting the data. 
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attempted to demonstrate that a 
particular situation is stressful in 
terms of our operational criterion and 
to analyze the effects upon the speed 
and accuracy of performance. A 
later article will present correlations 
of performance under stress with 
various personality measures such as 
the Rorschach test. 


PROCEDURE 


Our purpose was to produce a situation which 
would represent a real threat to a group of college 
students and which would allow opportunity to 
measure the effects of this threat upon the speedy 


accuracy, and total efficiency of performance. 
Two-gosupe-were' THUS Une consisted of male 


sophomore engineering majors at The Johns 
Hopkins University. The other was a begin- 
ning mixed class in psychology at Western Mary- 
land College? In the analysis of the data, both 
groups were combined since their results were 
similar. The total number of cases included 115 
experimental Ss and 73 controls. They were 
tested during regular class hours. 

The Ss were told that they were participating 
in an evaluation program, the results of which 
would be placed on their records. It was em- 
phasized that the intelligence test which was to 
be administered was an excellent predictor of 
college success. The Ss were informed that it 
was most important for them to do well since 
their performance might later serve as informa- 
tion for their faculty concerning their status in 
school. 

Both the Western Maryland group and The 
Johns Hopkins group were administered the 
following procedure: 

After the introductory motivating remarks, 
an extended version of Form 1 of the Wechsler- 
Bellevue digit-symbol subtest was given for a 
3-min. time period. During the succeeding 1.5 
hr. the Group Rorschach test was administered. 
During this period two Es removed the com- 
pleted digit-symbol tests, presumably for cor- 
rection. 

When the group had finished the Rorschach 
test, E read off a list of names with the instruc- 
tions that they report to another room. This 
represented the control group. The experi- 
mental and control groups were actually so 


2 The authors are grateful to Professor Paul 
Hessemer of The Johns Hopkins University and 
to Dr. Olive R. Russell of Western Maryland 
College for their kind cooperation in this experi- 
ment. 


divided that they were matched with respect to 
their college grade point averages. 

The experimental group was then told in the 
most severe manner that they had done very 
poorly on the original digit-symbol test. The 
seriousness of this failure was reaffirmed and it 
was suggested that many of them had apparently 
failed to appreciate the importance to them of 
this testing. They were told that another ad- 
ministration would give greater certainty that 
their true ability had been measured and that 
they would be given one more chance. It was 
explained that the other group had done satis- 
factorily on the test. 

The experimental Ss were then given Form 2 
of the digit-symbol test in a lengthened version, 
with a 3-min. time limit. At four intervals 
during the testing, E called out false norms which 
were beyond the capacities of the students to 
achieve. Each line of the test had been lettered 
so that E simply said loudly, “You should have 
finished line B,” then later, “You should have 
finished line E,” and so forth. Any protest or 
comment was met with a sharp, “Quiet.” 

The Ss in the control group were told that 
they had performed well on the digit-symbol 
test but that it was important to be sure that 
their true ability had been measured. They 
were informed that one more measurement would 
be made. They were told that their final score 
would be the average of the two testings, and 
were urged to improve their performance if 
possible. The attitude of E was friendly 
and encouraging, yet reasonably business-like. 
Three minutes were allowed for the test, and as 
in the case of the experimental group, four 
interruptions were made during the test with the 
presentations of false norms. However, these 
norms were well within the level of attained per- 
formance of each S so that this group was sup- 
ported rather than threatened throughout the 
test. Moreover, because of the interruptions 
the control Ss had the same amount of distrac- 
tion as the experimental Ss. 

At the end of the experiment, both groups 
were told the real nature of the experiment and 
allowed to give vent to feelings of relief. The 
purpose of the experiment and its implications 
were discussed with them. 


RESULTS 


Table 1 summarizes the total scores 
of both the experimental and control 
groups on Test 1 and Test 2, These 
data make it clear that the essential 
effect of the stress condition on total 
performance was to greatly increase 
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TABLE 1 


MEANs AND VARIANCES oF Gross Scores 
on Test 1 anp Test 2 





























Experimental Control 
Mean SD Mean SD 
Test 1 111.4 416.2 113.1 436.8 
Test 2 109.4 1096.1 109.4 219.0 
t 70 2.5 
F 2.6 2.0 
? 48 <1 | <02 | <.01 
the variance. The experimental 


group variance on Test 2 (the severe 
stress condition) was significantly 
higher than on Test 1. The control 
group, on the other hand, significantly 
decreased in variance on Test 2 when, 
for these Ss, the stressfulness of the 
situation was reduced. Both groups 
decreased slightly in mean score. The 
experimental group’s relative loss 
was less than that of the control 
group, but was too small to be con- 
sidered reliable. 

In Table 2 are presented the differ- 
ences in number of errors and speed 
between Tests 1 and 2. The experi- 
mental group shows an increase in 
errors while the control group shows a 
decrease, and this difference in the 
change in errors is statistically sig- 
nificant. 


TABLE 2 


DirFERENCES IN Error AND SPEED 
Scores on Test 1 anp Test 2 

















Error Scores Speed Scores 
Group 
' Mean SD Mean SD 
Experimental | +.643 | 4.8 —1.29 | 204.5 
Oy = 115) 
Control —.596 | 4.0 —3.56 | 112.4 
(N = 73) 
t 39 1.3 
F 1.2 1.73 
? <.001 | >.05 20| <.02 
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Both groups show a slight decre- 
ment in speed on Test 2, and the con- 
trol group’s loss is slightly greater 
than the loss for the experimental 
group. This picture tends to obscure 
the true state of affairs, however. 
Those Ss who made many more errors 
on the second testing had a tendency 
to increase in speed. This compen- 
sated for the greater number of errors, 
and in fact, the increase in speed oc- 
curred at the expense of accuracy. 
Speed accounts for 92% of the vari- 
ance in total scores. This means that 
the faster S worked, the higher score 
he made. 

As a consequence, the results on 
speed parallel those on total scores. 
While both groups show much the 
same picture with respect to mean 
change in speed, the variability of the 
two groups is significantly different. 

The effects of stress upon perform- 
ance are also shown by the correlations 
between scores on Tests 1 and 2 for 
the experimental and control groups. 
These are shown in Table 3. For 
total scores and speed the correlations 
for the control group are significantly 
higher than for the experimental 
group. Scores on Test 2 are more 
accurately predicted from Test 1 
when stress conditions are not intro- 
duced. 

All these results indicate that the 
effect of stress upon performance 


TABLE 3 


CoRRELATIONS BETWEEN First anp SECOND 
Test ror Totat Scores, Error 
Scores, AND SPEED Scores 











Experimental} Control 
Scores Group Group id p 
(N = 115) | (N = 73) 
Total 645 901 4.65 | .0003 
Errors .183 — .094 1.78 | .085 
Speed 687 870 | 3.12 | .004 

















* The significance of the differences between r’s is 
evaluated via Fisher's s transformation. 
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TABLE 4 


DIFFERENCES BETWEEN STUDENTS WITH HIGH 
anp Low Grape Point AVERAGES ON 
Tuetr PERFORMANCE UNDER STRESS 








Grade Point 
Average Signifi- 
cance 


Test 


Measure 





High Low 
(N =24)| (N =26) 





Mean Stress Score* | + 6.5 —2.4 #=2.97| <.01 
Variance Estimate 54.8 | 174.8 | F=3.19| <.01 


* See text. 

















varies with the individual. The per- 
formance of some Ss under stress is 
little affected whereas other Ss show 
large improvement or decrement. 
Since it was possible that the type of 
stress effect S showed might be re- 
lated to his grade point average, stress 
scores were obtained for each S in the 
experimental group. The stress score 
was obtained by first subtracting the 
total score on Test 2 from the score on 
Test 1. This difference score was 
highly correlated with Test 1 perform- 
ance (r = —.69 for controls and—.49 
for experimentals). To correct for 
this correlation, the deviation of each 
difference score from the value that 
would be predicted by the Test 1 
score was determined. The data 
from the control group were used in 
obtaining the prediction equation. 
The result is a score which has been 
corrected for Test 1 performance and 
for which positive values indicate 
improved performance under stress 
and negative values indicate a decre- 
ment. 

In Table 4 are shown these mean 
stress scores for Ss of the experimental 
group in the lowest and highest quar- 
tile in grade point average. The Ss 
with high academic standing did sig- 
nificantly better under stress than did 
the low group. Also, the Ss in the 
high group were more homogeneous in 
their performance under stress as is 
shown by the smaller variance. 


Discussion 


It must be recognized that the data 
which have been obtained are specific 
to the situation (task, instructions, 
stress inducement, and Ss) until 
proved otherwise. Moreover, the 
data do not provide information on 
the “why” of the effects measured. 
The performance of some Ss may im- 
prove because of greatly increased 
motivation aroused by the increased 
likelihood of failure. Or, in some 
cases, the motivation may become so 
great as to be disruptive through 
anxieties which are generated or ac- 
centuated by the situation and which 
act as an added distraction. It is 
also possible that some Ss did much 
worse on Test 2 because they became 
disgusted by the whole situation, or 
so threatened that their response was 
to give up or to “psychologically 
leave the field.” 


Our data have suggested that Ss with low 
grade point averages are more likely to show im- 
pairment and greater variability of performance 
under stress. Since merely telling people that 
they are taking an important test has stressful 
consequences, a certain number of Ss may be 
doing poor college work because of their inability 
to maintain the performance of which they are 
capable under such stressful circumstances. 
Moreover, as a group, Ss with low grade point 
averages might be expected to be more variable 
since adequate performance under stress is not 
the sole determiner of academic success. How- 
ever, an alternative explanation for these findings 
is possible. Ss who have been doing well in 
college may not be particularly threatened by 
the experimental situation. Their past success 
could have given them confidence in their abil- 
ity. However, interviews with a number of Ss 
after the experiment lead us to believe that our 
original explanation is the more likely. Even 
the best Ss reported very strong subjective ex- 
periences of stress. 

In any case, our results do indicate that, at 
least in the present situation, stress affected Ss 
differently. It causes some Ss to do better and 
others to do more poorly. Simply comparing 
the means of a control and experimental group 
may obscure or allow us to overlook this most 
important finding. 
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It is interesting to note that the data in Table 
1 show that the variability of the control group 
markedly decreases on Test 2 whereas that of the 
experimental group shows a large increase. We 
might think of Test 1 as moderately stressful 
for both the control and experimental groups 
(instructions specified that the results of this 
testing would be entered into the college record). 
For the control group Test 2 could be considered 
reassuring because they had been told that they 
had done well on Test 1. For the experimental 
group Test 2 was taken with the impression that 
they had failed Test 1. In view of the nature 
of these test conditions, we might make the as- 
sumption that we had produced three degrees of 
stress: mild, moderate, and severe. If this is 
the case we find that the group variability in- 
creases with the degree of stress. This notion 
needs further experimentation. 

The situation and task we used were such as 
to produce an increase in errors under stress. 
Frequently subjects do well under stress by in- 
creasing their speed at the expense of accuracy. 
In this particular situation, lowered accuracy 
mattered very little since it was so poorly cor- 
related with total scores. Undoubtedly, how- 
ever, there are situations where losses in ac- 
curacy would be serious. Lindsley (9) finds this 
kind of reaction to stress but gives it the ques- 
tionable interpretation of loss in efficiency. One 
of the general effects of stress may be to push 
Ss to increase both their speed and errors, even 
when this is not the most efficient thing to do. 
Much more work needs to be done on this ques- 
tion. 

Merely telling people that they are taking a 
test which will have an important bearing upon 
their welfare is apt to be a stressful situation. 
If this is so it has important consequences for 
testing. Hutt’s (8) clinical findings with intel- 
ligence testing are added evidence on this point. 
This kind of situation may, however, be useful 
in predicting which people will have difficulty 
with stress situations and which will profit from 
a degree of stress. 


SUMMARY 


The effects of a failure threat upon 
the performance of college students on 
an extended version of the Wechsler- 
Bellevue digit-symbol subtest were 
studied. The main results were: 


1. A significant increase in inter- 
individual variability in test perform- 
ance occurred in the stress group. 

2. An increase in the number of 
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errors made under stress was com- 
pensated for to some extent by an 
increase in speed. Speed scores ac- 
counted for 92% of the variance in 
total scores. 

3. Students with high grade point 
averages showed a tendency to im- 
prove their performance under stress. 
Students with poor academic standing 
did more poorly and were much more 
variable. 


(Manuscript received April 20, 1951) 
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RESPONSE STRENGTH IN A MODIFIED THORNDIKI# J 
MULTIPLE-CHOICE SITUATION AS A FUNCTION OF 
VARYING PROPORTIONS OF REINFORCEMENT 


ALBERT E. GOSS AND EDWARD J. RABAIOLI 
University of Massachusetts 


The importance of the frequency and 
pattern of reinforcements as parame- 
ters which influence the acquisition 
and extinction of responses has been 
emphasized increasingly in recent ex- 
perimental (3, 4, 5, 6, 7, 15, 17) and 
theoretical (8, 10, 14, 16, 18) litera- 
ture. However, experimental studies 
have been restricted to a consideration 
of either classical and instrumental 
conditioned responses or simple choice 
reactions as functions of varying pro- 
portions of reinforcement. Further, 
with the exception of a study by 
Brogden (1), there has been little 
interest in the more general problem 
of response strength as affected by 
varying degrees of post-acquisition 
changes in the proportions of rein- 
forcement. Accordingly it was the 
purpose of the present study to in- 
vestigate (a) acquisition of the rela- 
tively complex responses required in a 
modified Thorndikian multiple-choice 
situation as a function of varying pro- 
portions of randomly administered 
reinforcement, and (b) modification 
in the strength of already-acquired 
choice responses as a function of 
changed reinforcement proportions, 
specifically, the complete elimination 
of reinforcement and the reduction of 
the proportions of reinforcement to 
one-half of the corresponding acquisi- 
tion rates. 

Straightforward extension of stimu- 
lus response reinforcement principles 
(10, 19) of the relationship between 
response strength and varying pro- 
portions of reinforcement for simple 
or single-stimulus-unit learning situ- 


ations to the Thorndikian situation 
(19, 20, 21, 22) generates the predic- 
tion that rate of acquisition as meas- 
ured by trials or correct responses to 
criterion will be some direct function 
of the proportion of primary and/or 
secondary reinforcements. It is 
doubtful, however, that S-R rein- 
forcement principles for single units 
should be extended without modifica- 
tion to situations in which each trial 
consists of the serial and relatively 
massed intra-trial presentation of 
several stimulus units. Specifically, 
it would appear necessary to supple- 
ment such principles by an analysis 
of possible intra-trial and, to a lesser 
degree, inter-trial interactions among 
the several stimulus units in terms of 
principles of response-produced stimu- 
lation, stimulus compounding and pat- 
terning, and stimulus generalization 
(10, 15, 17, 18). Therefore, in addi- 
tion to the proportion of reinforce- 
ments, variables such as the number 
and composition of the stimulus units 
and the time intervals between units 
and between trials would seem rele- 
vant. Further, since a source or 
sources of secondary reinforcement 
may develop on the basis of and in 
addition to the original secondary re- 
inforcers of human learning tasks (for 


1 Humphreys’ (11, 12) expectancy theory has 
not been developed in a form that permits fairly 
rigorous experimental tests. Further, it was 
difficult to decide whether or not Tolman (23) 
would consider reinforcement of relevance for 
response strengthening in a modified Thorn- 
dikian situation. Accordingly, the derivation 
of predictions was within the framework of 
S-R reinforcement principles. 
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exymple, E’s “right’) the possible 
influence of this factor should also be 
assessed. 

Although necessary considerations 
for a complete theory of the relation- 
ship between learning and the propor- 
tion of reinforcement, such analyses 
appear premature in view of the lack 
of related data on the level of complex 
responses. Bearing in mind the neces- 
sity for future refinements, the pres- 
ent study is considered to be an ex- 
ploration of the more general hypothe- 
sis stated above, namely, that rate of 
learning increases as some direct func- 
tion of the proportion of reinforce- 
ment. 

The derivation of predictions of the 
effects of changed reinforcement con- 
ditions also rests upon considerations 
regarding the influence of additional 
sources of secondary reinforcement 
and the role in stimulus unit inter- 
action of principles of response-pro- 
duced stimulation, stimulus com- 
pounding, and stimulus generalization. 
Elimination of reinforcement will re- 
sult in removal from the stimulus 
compounds antecedent to each choice 
reaction of stimuli produced by re- 
sponses to reinforcers. Also, the 
stimulus effects of responses to non- 
reinforcement would be added to the 
compounds. Therefore, the increas- 
ing dissimilarity of acquisition and 
changed reinforcement stimulus com- 
pounds resulting from changes in the 
components of the compounds should 
lead, by the principle of generalization 
(10), to a decrement in the number of 
correct responses. Also, since the 
degree of dissimilarity is presumably 
greatest for high acquisition reinforce- 
ment proportions and least for low 
acquisition reinforcement proportions, 
a direct relationship between size of 
decrements and the proportion of ac- 
quisition reinforcements would be 
anticipated. 


In the case of halved reinforcement 
conditions the effects of increased dis- 
similarity of acquisitions and changed 
reinforcement stimulus compounds 
should be counterbalanced in part by 
continued reinforcement, although at 
reduced rates. Accordingly, while 
dissimilarity of pre-choice stimulus 
compounds might occasion initial dec- 
rements in the frequency of correct 
responses, the influence of the latter 
factor should eventually check and 
reverse those decrements. Because 
of continued reinforcement and fur- 
ther because of the relatively smaller 
changes in compound similarity, some- 
what smaller decrements for halved 
than for elimination of reinforcement 
conditions would be predicted. 


METHOD 


Subjects—Ninety-two men and women from 
University of Massachusetts undergraduate psy- 
chology classes volunteered to serve as Ss. Be- 
cause the experiment involved color discrimina- 
tion, data for two color-blind Ss were eliminated, 
thus leaving 90 Ss. 

Apparatus and stimulus materials.—A memory 
drum was used to present eight modified multi- 
ple-choice units (19, 20, 21, 22) each of which 
consisted of three nonsense syllable choice- 
stimuli underscored with a narrow strip of 
Milton-Bradley Bulls-Eye construction paper. 
The 24 syllables had been selected at random 
from a subgroup of every other one of Witmer’s 
(24) consonant nonsense syllables of 8, 13, 17, 
21, and 25% association value. They were as- 
signed at random to any one of the eight units, 
subject to the restriction that each unit consist 
of syllables of approximately equal association 
value and have no letters in common. The 
three syllables of each unit were typed in capital 
letters on a single line and underscored with one 
of eight 14 X }-in paper strips of different and 
easily discriminable colors. 

Since Thorndike and his co-workers had not 
randomized the positions of choice stimuli, 
choice positions within each unit were not varied 
in the present study.? In order to minimize the 
influence of initial position preferences on cor- 


2 The color stimuli were introduced to make 
the stimulus units more distinctive. For a 
given S, correct responses for a given unit could 
be based entirely on syllable position. 
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TABLE 1 
Summary or ExpERIMENTAL PRocEDURE 
on oy Aw 
Group N | iment Pro. | ment Pro- 
portions portions 

100-0 15 100 0 
100-50 15 100 50 

75-0 15 75 

75-37.5 15 75 37.5 

50-0 15 50 

50-25 15 50 25 














rect syllable selection three sequences of syllable- 
reinforcement conditions were prepared and 
assigned to Ss in such a manner that (a) for 
each of the three sequences there were choices 
at each of the three possible syllable positions, 
and (b) each unit’s three syllable-position com- 
binations were correct for one-third of the Ss of 
each experimental group. Possible serial learn- 
ing effects were minimized by the use of two 
randomly determined orders of the same eight 
units. 

Presentation of the units was at a 3-sec. rate 
with a 10-sec. inter-trial interval. 

Procedure-—The experimental design is out- 
lined in Table 1. The Ss were randomly as- 
signed to one of the six conditions as they re- 
ported to Z. After being tested for color-blind- 
ness by means of the Ishihara cards, they read 
appropriate instructions along with £. 

During the acquisition sequence Ss of each 
group were required to select and to call out the 
letters of one of the three syllables of each unit. 
Correct choices as designated by E were rein- 
forced immediately by means of a buzzer. The 
acquisition trials were terminated when Ss at- 
tained a criterion of 15 of 16 correct choices on 
two successive trials. Details of the acquisition 
reinforcement procedures for each group were as 
follows: 

100% reinforcement.—The buzzer sounded 
each time S selected and called out the cor- 
rect syllable. ; 

75% reinforcement.—The sequence of 
buzzer occurrences for correct choices was 
randomly predetermined in such a manner 
that on a given trial six of eight of the cor- 
rect choices would be reinforced and for 
each successive block of four trials three of 
four of the correct choices for a given stimu- 
lus unit would be reinforced. 

50% reinforcement.—Subject to the re- 
striction of no more than two successive pos- 
sible reinforcements or non-reinforcements 
for correct choices for a given stimulus unit, 


the sequence of buzzer occurrences was ran- 
domly predetermined in such a manner that 
on a given trial four of eight of the correct 
choices would be reinforced and over several 
trials 50% of the correct choices for a given 
stimulus unit would be reinforced? 

Without further instructions to S, the 20 
trials under changed reinforcement conditions 
were initiated immediately following the second 
criterion trial. Reinforcement for correct re- 
sponses was completely eliminated for Groups 
100-0, 75-0, and 50-0. The proportions of 
randomly predetermined reinforcements for each 
trial and for the correct syllable of each unit 
were reduced from 100 to 50%, from 75 to 37.5%, 
and from 50 to 25% for Groups 100-50, 75-37.5, 
and 50-25 respectively. 

Both correct responses and any incidental 
remarks made by S during the experiment were 
recorded. Before leaving the experimental situ- 
ation S was given a questionnaire which was 
designed to elicit supplementary information re- 
garding both bases of syllable choice and reac- 
tions to changed reinforcement conditions. 


RESULTS 


Acquisition—Means and SD’s of 
trials, correct responses and rein- 
forced correct responses to the cri- 
terion for the six conditions are pre- 
sented in Table 2. Statistical analy- 
sis by means of the ¢ and F tests indi- 
cated that the differences between the 
means and variances of trials, correct 
responses, and reinforced correct re- 
sponses to the criterion for each of the 
three pairs of groups which had had 
the same acquisition reinforcement 
proportions were not statistically sig- 
nificant. Accordingly, data for the 
two 100%, the two 75%, and the two 
50% groups were pooled. These 
data are also summarized in Table 2. 

Tests of the hypothesis of no 
differences among mean trials and 
mean correct responses to criterion 


for the combined 100, 75, and 50% 


* The present procedure, which involves vari- 
ation in the proportion of reinforcements for a 
single unit over a series of trials, should be dis- 
tinguished from the procedure of varying the 
proportion of trials on which all responses were 
reinforced or non-reinforced. It is planned to 
use the latter procedure in a subsequent study. 
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TABLE 2 


Triats, Correct Responses, AND Retnrorcepy Correct Responses TO CRITERION FOR 
Eacu AcquisiT1Ion-CHANGED REINFORCEMENT Group AND FoR PooLep Groups 
wuicH Hap THe Same AquisiITION REINFORCEMENT PROPORTIONS 
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Trials to Criterion Come be rams yon sew Senpenaes 

Group 
M SD M SD ', SD 
100-0 22.8 5.8 98.5 25.8 98.5 25.8 
100-50 21.7 9.5 91.7 33.1 91.7 33.1 
75-0 33.1 9.4 145.7 36.5 108.9 27.4 
75-37.5 37.8 6.0 162.5 39.8 120.2 29.1 
50-0 42.9 7.6 179.6 37.2 87.2 19.3 
50-25 42.0 11.4 176.7 47.4 87.1 23.5 
100 22.3 7.9 95.1 29.8 95.1 29.8 
75 35.4 8.2 154.1 39.0 114.5 28.8 
50 42.4 9.7 178.1 42.6 87.1 21.5 

reinforcement conditions yielded and Campbell (9) modification of 


F-ratios of 28.8 and 37.47 respec- 
tively. Since both values were sig- 
nificant at beyond the 1% level of 
confidence for 2 and 87 df, it may be 
concluded that whether measured in 
terms of trials or correct responses to 
criterion, the 100% or continuous re- 
inforcement group was superior to the 
75% group whose performance in 
turn exceeded that of the 50% group. 
The F-value of 7.84 for means of 
reinforced correct responses to cri- 
terion was also significant (p < .01; 
df = 2,87). However, mean differ- 
ences for this measure were less strik- 
ing and, because the 50% condition 
required the smallest number of rein- 
forced correct responses, in different 
directions than condition differences 
for trials and correct responses. 
Since there were within-group and 
between-group differences in correct 
responses to the criterion, the Hilgard 


4 Bartlett’s x? test (2) was utilized to test the 
homogeneity of the variances of trials, correct 
responses, and reinforced correct responses. 
Since the x? values of 1.44, 3.10, and 3.50 for 
trials, correct responses, and reinforced correct 
responses, respectively, did not reach the 5% 
level (df = 3), the hypothesis of no differences 
among group variances could not be rejected. 


Vincent’s procedure was utilized to 
determine and to compare the learn- 
ing functions of the three groups. 
When per cent of total correct re- 
sponses within successive tenths of 
trials to criterion was plotted as a 
function of successive tenths to cri- 
terion, visual inspection of the three 
resultant curves suggested that they 
could be represented by the same ap- 
proximately linear function. 

Changed reinforcement.—Figure 1, 
whose ordinate values are based on 
the single trial maximum of eight cor- 
rect responses, is a graphic representa- 
tion of mean correct responses for 
each of the six conditions for the two 
acquisition criterion trials and for five 
successive four-trial blocks under 
changed reinforcement conditions; the 
points labeled “totals” represent mean 
correct responses for the 20 changed 
reinforcement trials as a whole divided 
by 20. Inspection of the six curves 
reveals initial downward trends from 
criterion values to Block II which are 
somewhat greater for the elimination 
of reinforcement conditions; these 
decrements are followed by upward 
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CRITERION BLOCKS TOTAL 
Fic. 1. Mean correct responses per trial for criterion trials, five successive 
four-trial blocks, and for the 20 changed reinforcement trials as a whole 
somewhat positively accelerated ments from criterion to Blocks I and 


trends from Block II to Block V. 

The one-tailed ¢ was utilized to test 
the null hypothesis with respect to 
intra-group decrements from criterion 
mean values to means for totals and 
to means for Blocks I, II, and V. 
Since the #’s for decrements from cri- 
terion to total means, which ranged 
from 2.42 to 6.01, were significant at 
from the 3% to beyond the 1% levels, 
it was concluded that for each of the 
six changed reinforcement conditions 
there was a significant reduction from 
criterion levels in the mean frequency 
of correct responses for the 20 trials 
as a whole. With the exception of 
the criterion—Block I comparison for 
the 75-37.5 group, statistically sig- 
nificant (#’s from 2.12 to 8.36 with 
corresponding ’s ranging from the 
4% to beyond the 1% level) decre- 


5 Because the small variances of the criterion 
trial scores led to marked departures from the 
homogeneity of the variances of the distributions 
which were compared, p values for the preceding 
and subsequent ?’s were based on 7 df (2). 


II were obtained. That these dec- 
rements were not permanent, how- 
ever, is suggested by the less signifi- 
cant differences (t’s from 1.44 to 3.00 
with corresponding p’s from 12% to 
1%) between criterion and Block V 
means. Also, although the _ incre- 
ments were not statistically signifi- 
cant, the fact that with the exception 
of the 75-37.5 group, Block V means 
for each group were higher than cor- 
responding values for Block I adds 
support to this suggestion. 

Analysis of variance for related 
measures (2) was utilized to test the 
hypothesis of no differences among the 
five four-trial block means for each of 
the six groups. Since the F-ratios 
for Groups 100-50, 75-0, 50-0, and 
50-25 (3.75, 3.17, 10.72, and 4.15, 
respectively) were significant at from 
the 5% to beyond the 1% level for 
4 and 56 df, it was concluded that the 
observed downward-upward trends 
for these conditions could not be at- 
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tributed to chance fluctuations. The 
F’s for the two remaining groups were 
not significant.® 

The null hypothesis with respect to 
mean correct responses for the 20 
trials as a whole (totals) and to four- 
trial block trends for the six combina- 
tions was also tested by means of an 
analysis of variance technique (2).? 
Failure to obtain a significant F 
(Table 3) for the ratio of the between- 
group totals variance to the variance 
estimate based upon Ss of the same 
group suggests that the six groups did 
not differ significantly in mean num- 
ber of correct responses for the 20 
changed reinforcement trials. The 
ratio of the variances based upon the 
interaction of methods and trials and 
the interaction of pooled Ss and trials 
permits a test of the hypothesis of no 
differences among the four-trial block 
changed reinforcement trends for the 
six groups. The F of .809 which was 
computed from this ratio was con- 
sistent with the hypothesis of no 
trend differences, that is, it indicated 
that the changed reinforcement curves 
for the six groups had similar forms. 

Questionnaire.—The questionnaire 
results indicated that a similar number 
of Ss in all groups tended to use color 
and/or position and letters of the 
syllable as the bases of choice re- 
actions. Spontaneous incidental re- 
marks made by a few Ss during 


®Since the x? values for the 75-37.5 and 
50-25 groups permitted the rejection of the hy- 
pothesis of homogeneous variances at the 1% 
level, the F’s for these groups should be evaluated 
conservatively. It should be noted, also, that 
because the x? values were based upon correlated 
measures, an underlying assumption was vi- 
olated. 

7 Bartlett’s test yielded x* values of 18.49 for 
totals (df = 5), and 115.73 for the 30 four-trial 
block values (df = 29) which were significant at 
the 1% level. Also, it should be commented 
that, contrary to underlying assumptions, the 
x? values for the trial and block variances were 
based upon correlated measures. 


TABLE 3 


ANALYsIs OF VARIANCE oF CoRRECT 
RESPONSES UNDER CHANGED 
REINFORCEMENT PROPORTIONS 








Source of Variation df | Variance F 
Between-group totals 5 | 984.07 | 1.61 
Between-subjects in 

same group 84 | 114.55 
Total between subjects 89 
Between four-trial blocks 4] 171.96 | 21.05* 
Groups X trials 20 6.61 | 0.81 
Pooled subjects X trials | 336 8.17 
Total within-subjects 360 
Total 449 














* Significant at the 1% level of confidence. 


changed reinforcement trials, as well 
as questionnaire results, indicated that 
some Ss in all groups recognized cor- 
rectly that changes in the rate of 
buzzer presentation had occurred. 
Specifically, 42 of 45 Ss in the three 
elimination of reinforcement groups 
and 33 of the Ss of the halved rein- 
forcement groups were able to rec- 
ognize the changes correctly. A x? 
test of independence of degree of 
change and recognition of change 
yielded a value of 6.48 which is sig- 
nificant at the 2% level for 1 df. It 
was concluded, therefore, that the 
halved reinforcement groups did not 
recognize the changes in rate as 
readily as the elimination of reinforce- 
ment groups. 


Discussion AND CONCLUSIONS 


Acquisition.—Proceeding from the 
stimulus-response reinforcement con- 
ception of response strength as a con- 
tinuous and cumulative function of 
the number of reinforcements, it was 
predicted that in a modified Thorn- 
dikian situation rate of acquisition as 
measured by the number of trials 
and/or correct responses to criterion 
would be a direct function of the pro- 
portion of reinforcements. The ac- 
quisition data appear to be consistent 
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with this prediction, with, specifically, 
the 100% groups requiring approxi- 
mately three-quarters and one-half as 
many trials and correct responses to 
criterion as the 75% and 50% groups, 
respectively. Also consistent with 
the conception of the continuous and 
cumulative influence of the number of 
reinforcements was the finding that 
the Vincent curves for the three ac- 
quisition reinforcement schedules were 
similar in form. Pertinent to and 
qualifying the preceding conclusion, 
however, is a reiteration of the neces- 
sity for explicitly incorporating prin- 
ciples of the interaction of stimulus 
units into a more complete systema- 
tization of learning in the Thorndikian 
situation under varying schedules of 
reinforcement and conditions of unit 
presentation. 

Changed reinforcement proportions. 
—The curves obtained for changed 
reinforcement conditions exhibited ini- 
tial downward trends from criterion 
levels to the second four-trial block 
which were followed by upward some- 
what positively accelerated trends; in 
general, the observation of the down- 
ward-upward curve forms was sup- 
ported by statistical analyses. The 
apparent positive acceleration of the 
upward trends when combined with 
the findings of higher means for Block 
V than for Block I suggests not only 
that the initial decrements were not 
permanent but also that had <ddi- 
tional reinforcement trials been given, 
all groups might have reached correct 
response levels which did not differ 
significantly from corresponding cri- 
terion values: It is possible, of 
course, particularly in the case of the 
elimination of reinforcement groups, 
that additional changed reinforcement 
trials would have led to a second 
downward trend. 

Contrary to pre-experimental pre- 
dictions, however, were the reversals 
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of the initial downward trends of the 
elimination of reinforcement groups; 
a finding which raises a problem of 
considerable theoretical interest, 
namely the identification of a source(s) 
of continuing reinforcement for the 
presumably non-reinforced choices of 
these groups. In this connection it 
would appear possible that during the 
acquisition trials the correct syllables 
themselves had become secondary re- 
inforcers and therefore, because in- 
sufficient non-reinforcement trials had 
been given to extinguish their secon- 
dary reinforcing properties, continued 
to reinforce correct choices. How- 
ever, the essentially linear relationship 
between trials and errors to criterion 
and the proportion of reinforcement 
as well as the similar Vincent curves 
are in opposition to the notion that 
the correct syllables acquired rein- 
forcing properties during the acquisi- 
tion trials. 

A second possibility is based on the 
assumption that during changed rein- 
forcement trials the selection of other 
previously incorrect but also non-rein- 
forced choices resulted in greater 
frustration than the continuation of 
the previously correct choices. Ac- 
cordingly, the hypothesis is advanced 
that as changed reinforcement trials 
continued, failure to select previously 
incorrect choices actually occasioned 
a relative reduction in frustration 
which served to reinforce the correct 
choices of the acquisition trials. 

A second problem of theoretical 
interest stems from the Grant, Rio- 
pelle, and Hake (5) suggestion. that 
the trend reversal which they ob- 
served could be interpreted as com- 
pensation for response suppression 
during the acquisition trials. While 
the requisite identifications of a 
source of suppression and of sup- 
pressed responses in the multiple- 
choice situation has as yet not been 
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possible, this explanation of the up- 
ward trend may prove relevant to the 
present data. 


SUMMARY 


Ninety undergraduate Ss were re- 
quired to learn a list of eight modified 
multiple-choice units to a criterion of 
15 correct of 16 choices. The units 
of the list consisted of a colored paper 
stimulus and three nonsense-syllable 
choice stimuli and were presented on 
a memory drum at a 3-sec. rate. A 
buzzer served as the “reinforcer” for 
correct responses. The Ss were ran- 
domly assigned to one of six numeri- 
cally equal groups. During the ac- 
quisition trials, correct choices for two 
groups were reinforced 100% of the 
time, two other groups were given re- 
inforcement for 75% of correct re- 
sponses, and the two remaining groups 
received 50% reinforcement. Upon 
completion of the acquisition trials, 
reinforcement was halved for one 
of the 100% acquisition groups and 
eliminated for the other 100% group. 
The same procedure was followed for 
the two 75% acquisition groups and 
for the two 50% groups. All groups 
were given 20 trials under the changed 
reinforcement conditions. 

The mean trials to criterion of 22.3, 
35.4, and 42.4 for the pooled 100%, 
75%, and 50% acquisition conditions, 
respectively, and the parallel relation- 
ship among mean number of errors to 
criterion, were in accordance with the 
direct relationships between response 
strength and reinforcement propor- 
tions generally observed for condi- 
tioned or simple choice responses and 
thus supported the S-R reinforcement 
conception of the continuous and 
cumulativé influence of the number of 
reinforcements. The downward-up- 
ward trends which resulted from 
halving the proportions of reinforce- 
ment were in line with the notion of 


decrements resulting from dissimilar- 
ity of acquisition and changed rein- 
forcement stimulus patterns which 
were offset by continuing reinforce- 
ment. The downward-upward trends 
of the groups for which reinforcement 
was eliminated were not predicted. 
It was hypothesized that these trend 
reversals might have reflected the 
strengthening effects of unanticipated 
but continuing sources of secondary 
reinforcement. 


(Manuscript received April 23, 1951) 
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MOTIVATION IN LEARNING: XI. AN ANALYSIS OF 
ELECTRIC SHOCK FOR CORRECT RESPONSES 
INTO ITS AVOIDANCE AND ACCELERATING 

COMPONENTS ! 


KARL F. MUENZINGER, WILLIAM O. BROWN, WAYMAN J. CROW, 
AND ROBERT F. POWLOSKI 


University of Colorado 


In studies of electric shock for cor- 
rect turns, it was found repeatedly 
that although learning of a black- 
white discrimination by white rats 
was definitely facilitated by shock for 
correct turns it was never quite as 
efiective as shock for wrong turns (2, 
p. 181). This is so in a corrective 
situation where an animal is per- 
mitted to retrace the wrong alley and 
enter the right one. In a non-cor- 
rective situation, where an animal is 
not permitted to retrace after enter- 
ing the wrong alley, the accelerating 
effect of shock for correct turns seemed 
slight indeed (3), if not absent, as 
claimed by George J. Wischner (4). 
In attempting to account for this dif- 
ference between the effects of shock 
for correct turns in a corrective znd a 
non-corrective situation Muenzinger 
and Powloski postulated “that shock 
after the point of choice whether 
for right or wrong turns in a visual 
discrimination box has an accelerating 
function in both situations, non-cor- 
rective as well as corrective, but in 
addition it also has an avoidance 
function which may under certain 
circumstances mask the accelerating 
effect. Shock after the point of 
choice makes the animal more sensi- 
tive to the cues to be discriminated, 
but it also creates a tendency in an 
animal to avoid the alley in which it 


1The authors wish to thank the Council on 
Research and Creative Activity of the Univer- 
sity of Colorado whose financial assistance made 
this study possible. 


has been shocked. If it should be 
possible to train an animal to adapt 
itself to, and thus overcome, the avoid- 
ance effect it might then exhibit the 
accelerating effect more clearly” (3, 
p. 123). 

In order to test this hypothesis, 30 
white rats received preliminary train- 
ing with electric shock in a straight- 
away, 30 rats received such training 
without shock, and 30 rats received 
no preliminary training at all, before 
they all learned a black-white dis- 
crimination in a non-corrective situ- 
ation. In each of the three groups of 
30 Ss, 10 learned the discrimination 
without shock (no-shock), 10 with 
shock for wrong turns (shock-wrong), 
and 10 with shock for correct turns 
(shock-right). According to the hy- 
pothesis the shock-right Ss that had 
preliminary training with shock should 
now show a learning record approach- 
ing that of the shock-wrong Ss. In 
order to answer the question whether 
this expected result would actually be 
due to the preliminary shock experi- 
ence or merely to the preliminary 
training in the straightaway, the addi- 
tional subgroup with such training, 
but without shock, was needed. 


ConpDITIONS AND MeETHops 
oF TRAINING 


The apparatus, a T-shaped discrimination 
box, has been described in the preceding article 
(3). In addition, a straightaway was used, 15 
in. long and 3 in. wide, with a starting box at 
one end and a goal box at the other. The floor 
of the middle section consisted of two solid 
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copper strips 15 in. long and slightly less than 
1} in. wide. They were about } in. apart at the 
center of the alley and formed a shallow trough 
so that any feces would drop through. They 
functioned as an electric grid. 

Ninety white rats of Sprague-Dawley strain 
were divided into nine groups as indicated in 
Table 1. General handling and feeding were 
also the same as in the preceding experiment 
(3). The rats were motivated by a 23-hr. food 
deprivation. 

Six of the nine groups were given preliminary 
training in the straightaway for 12 days of ten 
runs each. At the end of each run S was re- 
warded with food in the goal box. The first 
four runs of the first day were used to accustom 
S to the apparatus. It was first placed in the 
goal box and then farther and farther away and 
finally with the fifth run S was placed in the 
starting box and allowed to proceed to the goal 
box. 

From the second to the twelfth day three of 
the groups were run without change of condi- 
tions. They constitute the 30 “preliminary 
training without shock” Ss which were treated 
differentially only in the discrimination box 
where one group of 10 Ss received no shock at 
all (Group Id), 10 received shock for wrong 
turns (Group IId), and 10 for correct turns 
(Group IIId). 

The other 30 Ss, referred to as having “‘pre- 
liminary training with shock,” received shock in 
the straightaway at every other trial beginning 
with the second trial on the second day, or the 
twelfth trial counting from the beginning. The 
strength of the electric current was gradually 


TABLE 1 


Non-Corrective Learninc ConpiTIONs 
(N = 10 for each group) 











ow | Tae | ae 
Ib* none none 
IIb* none for wrong turn 
IIIb* none for correct turn 
Ic with shock none 
IIc with shock for wrong turn 
IIIc with shock for correct turn 
Id without shock none 
IId without shock for wrong turn 
IIId without shock for correct turn 











* Groups Ib, IIb, and IIIb are the same as those with 
similar labels in the preceding experiment (3). 

2 Credit for suggesting this simple and effec- 
tive type of grid should go to Dr. Karl Zener of 
Duke University to whom the authors hereby 
express their thanks. 
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increased from 30 to 100 microamperes from 
Trial 12 to Trial 42, and then 20 microamperes 
per day reaching 240 on the last day. In the 
discrimination box, where the current was never 
above 100 microamperes, the 30 Ss were also 
treated differentially, 10 receiving no shock 
(Group Ic), 10 shock for wrong (Group IIc), and 
10 shock for correct turns (Group IIIc). 

Three of the nine groups (Ib, IIb, and IIIb) 
were given no preliminary training in the 
straightaway. They were the same as the 
groups with similar labels in the preceding ex- 
periment (3). 

The handling of Ss in the discrimination box 
was exactly the same as described in the preced- 
ing article, including the two days of accustoming 
them to the apparatus. The lighted panel was 
again the positive cue, and the criterion of learn- 
ing consisted of two successive errorless daily 
runs of ten trials each. The method was non- 
corrective, that is, an S having made a wrong 
turn was not permitted to retrace, but had to 
proceed forward to the starting point, from there 
again to the choice point, and so on. 

No £ handled more than five Ss in any one 
group. The senior author was responsible for 
planning the experiment and writing this report. 


RESULTS 


The results are stated in terms of the 
mean number of “reinforcements” and 
“wrong turns” to reach the criterion. 
This terminology, replacing the usual 
terms “trials” and “errors,” was 
adopted for reasons given elsewhere 
(3). 

Our major concern was the question 
whether the accelerating effect of 
electric shock for correct turns in the 
non-corrective situation would show 
itself clearly if S had by previous train- 
ing learned to overcome the avoidance 
effect of shock. The results show that 
this is so. Without previous experi- 
ence with shock the shock-right group 
is significantly poorer in terms of re- 
inforcements and wrong turns than the 
shock-wrong group, as shown in com- 
paring Groups IIIb and IIb. How- 
ever, given such experience, the differ- 
ence between the shock-right and 
shock-wrong groups becomes insig- 
nificant, as shown by comparing 


Groups IIIc and IIc. 
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TABLE 2 


Mean REINFORCEMENTS AND Wronc Turns PER S To THE CRITERION 

















No Pre-Training Pre-Training with Shock Pre-Training without Shock 
Deataae | o Reinf Wes 1a Reinf Wrong ic Reinf yume 
‘on ten p. einf, Turns p. einf. Turns Dp. einf. eres 
M | SD M | SD M |SD|M | SD M | SD M | SD 
No-shock Ib] 156 | 50.0 | 41.9} 12.2 Ic} 195 | 80.5 | 61.2 | 29.6 Id] 164 | 46.5 | 47.2 | 19.5 
Shock-wrong | IIb} 78 | 14.7}19.2| 3.0] IIc} 72]|13.3]17.2] 6.8] IId| 69}|13.0/ 13.3] 5.1 
Shock-right | IIIb} 120 | 39.0} 33.0] 6.9} IIIc] 82 |28.2|22.0] 7.8| IIId} 114 | 27.6|33.2| 7.2 


















































Another way of demonstrating the 
accelerating effect of shock-right is by 
comparing the shock-right group 
which had experience with shock prior 
to the discrimination box, and the 
other shock-right groups that had not. 
In terms of both reinforcements and 
wrong turns, Group IIIc, having had 
pretraining with shock, is significantly 
better than Groups IIIb and IIId 
which had their first experience with 
shock in the discrimination box. 

Since it seemed possible that pre- 
training alone in the straightaway 
might have produced these results, 
even if no shock had been present 
during this phase of the experiment, a 
set of Ss was given pretraining without 
shock in the straightaway before they 
were run in the discrimination box. 
The results in terms of reinforcements 
and wrong turns show that the shock- 
right groups with and without such 
pretraining do not differ significantly, 


TABLE 3 


p-VaLues oF DIFFERENCES BETWEEN MEANS 
or Boru REINFORCEMENTS AND 
Wronc Turns * 











Group IIb IIc IId IIIc IIId 
IIIb | <.01° <.01 | >.05 
IIIc >.05 

IlId <.01 























* These values were obtained by Festinger’s method 
for skewed samples (1). 


as seen in comparing Groups IIIb and 
IIId. 

And again, with no previous shock 
experience, although with pretraining 
in the straightaway, the shock-right 
group is significantly poorer than the 
shock-wrong group, as seen in com- 
paring Groups IIId and IId, whose 
records closely resemble those of 
Groups IIIb and IIb, the original 
shock-right and shock-wrong groups 
without any pretraining. 


Discussion 


The close resemblance of the learn- 
ing efficiency of a shock-right group 
in a non-corrective situation to that of 
a shock-wrong group when both have 
been given previous experience with 
shock as compared with the significant 
difference between similar groups 
without such experience can have only 
one meaning: shock in this learning 
situation has essentially two effects, 
avoidance and acceleration. 

If an animal is shocked when it 
makes a correct turn this experience 
first tends to bring about an avoid- 
ance of shock. It is only after the 
tendency to avoid the shock has been 
somewhat overcome that the acceler- 
ating function of shock after the point 
of choice can manifest itself unhin- 
dered. If the first experience with 
shock occurs in the discrimination 
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box and the tendency to avoid it has 
to be overcome there, the course of 
learning by shock for correct turns 
will be slowed down as shown in the 
records of Wischner’s Ss and of our 
Groups IIIb and IIId. If, however, 
the tendency to avoid shock has been 
overcome by experience with shock 
prior to running the discrimination 
box, the accelerating effect of shock 
shows itself more immediately, and 
learning by shock for correct turns, as 
in Group IIIc, appears to be practi- 
cally as accelerating as shock for wrong 
turns. 

It might be asked whether pretrain- 
ing with shock has a similar result in a 
corrective situation. One could 
hardly expect, however, to obtain 
more of an accelerating effect of shock 
for correct turns where it is already 
clearly manifested. Without pre- 
training the shock-right method pro- 
duces a learning efficiency somewhat 
poorer than the one produced by the 
shock-wrong method, but the differ- 
ences are not always significant. In 
an extension of the present experi- 
ment, the details of which are not re- 
ported here, three shock-right groups 
were trained in a corrective situation, 
one with no pretraining, one with pre- 
training in the straightaway with 
shock, and one with pretraining with- 
out shock. The mean reinforcements 
were 72, 72, 70, and the mean wrong 
turns 17.0, 16.7, and 16.6 respectively. 
In other words, prior experience with 
shock had no effect in a situation 
where the accelerating effect is pretty 
much in evidence already. 

It is not possible to tell whether in 
this experiment the length of the pre- 
training in the straightaway and the 
intensity of the electric current are 
optimum values. Perhaps shorter 
pretraining and a smaller current 
might have had the same effects. 

In the straightaway shock was ad- 
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ministered only in alternate trials in 
order to simulate somewhat the suc- 
ceeding experience in the discrimina- 
tion box where, in the beginning at 
least, Ss would not experience shock 
after each choice. Again it is not 
possible to tell whether shock on every 
trial in the straightaway might not 
have been more effective. To have 
given shock on every trial in the 
straightaway might have created an 
advantage for the shock-right condi- 
tion since shock on every trial could 
have led Ss to expect food after shock. 
This might have produced a prefer- 
ence for the shock side which would 
be the correct side for the shock-right 
group but the incorrect side for the 
shock-wrong group. By feeding on 
every trial and shocking on alternate 
trials one would expect that no sys- 
tematic preference would exist for the 
shock or no-shock side in either Group 
II or III. 

It might not be amiss to point out 
that under the conditions used here 
the rats, even though at first dis- 
turbed by shock in the straightaway, 
overcame their avoidance reaction 
fairly well and were willing eventually 
to cross a grid which must have given 
them a considerable shock. The 
gradual increase in current, no less 
than the frequent exposure to it (55 
times), was probably an important 
factor in overcoming undesirable emo- 
tional effects, if such effects existed 
at the beginning, which is probable. 


SUMMARY 


The purpose of this study was to 
find out whether the accelerating 
effect of electric shock for correct 
choice (in a non-corrective situation) 
would manifest itself more readily if 
the avoidance effect of shock had been 
overcome previously. 

White rats were given 120 trials in 
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a straightaway and received shock in 
55 of the trials. They were subse- 
quently trained by the non-corrective 
method to learn a black-white dis- 
crimination. The Ss which were 
shocked for correct turns were only 
slightly poorer than those shocked 
forwrongturns. The differences were 
statistically insignificant. 

Among other Ss that had not re- 
ceived any prior experience with 
shock in the straightaway those 
trained with shock for correct turns 
were significantly poorer than those 
trained with shock for wrong turns. 

It is assumed that electric shock in 
discrimination learning has two func- 
tions: (a) producing avoidance be- 
havior, and (b) accelerating learning. 
If the tendency to avoid shock is over- 
come prior to training in the dis- 


crimination box, the accelerating func- 
tion will manifest itself unequivo- 
cally. 


(Manuscript received April 23, 1951) 
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THE EFFECT OF. DEGREE OF SHADING CONTRAST 


IN INK BLOTS ON VERBAL RESPONSE 


JOHN C. BALLOCH 
Fisk University! 


Recent years have seen a tremen- 
dous growth in the popularity of the 
so-called projective techniques. 
Among the most prominent of these 
techniques is the Rorschach method 
of personality evaluation utilizing ten 
standardized ink blots (6). 

As the Rorschach method has grown 
in importance as a clinical tool, the 
amount of importance attached to the 
shading responses as specific indicators 
of types of adjustment or maladjust- 
ment has alsoincreased. By the term 
shading responses, we mean _ those 
verbal productions which are deter- 
mined wholly or in part by the shad- 
ing nuances which appear in practi- 
cally all the blots. They are the 
responses to the stimuli along the 
black-white axis of the color pyramid, 
including responses to black, white, 
and intermediate greys. 

Less is known about the significance 
of the shading responses than any 
other variable in the Rorschach. 
This is partly because the significance 
of shading was considered relatively 
late in the use of the Rorschach test, 
and partly because it is the one re- 
sponse which Rorschach himself did 
not fully describe. The evidence, 
such as we have, is based on clinical 
observation and quite a large amount 
of it is purely speculative. There is 
a paucity of experimental data in all 
phases of Rorschach development. 
Rorschach himself, and most of his 


1The data contained in this article were 
gathered at Michigan State College as part of a 
doctoral thesis. The author wishes to express 
his gratitude to Dr. S. Howard Bartley for his 
help and guidance in directing this study. 


followers, have freely admitted this 
lack. 

There is an especial lack of informa- 
tion concerning the relationship be- 
tween type of response and the vari- 
ables in the stimulus ink blot which 
provoked it. Gibson (3) states that 
the Rorschach method “. . . presup- 
poses a non-relationship between the 
stimulus pattern and the perception. 

.’ However, as Gibson goes on to 
point out, such a relationship does 
exist. The present study is an at- 
tempt to shed some light on the nature 
of this relationship through varying 
one aspect of the stimulus ink blot, 
the degree of shading contrast. 

Of the many stimulus variables 
present in an ink blot, the shading 
variable was chosen for study because 
of the importance which clinicians at- 
tach to its use as indicators of malad- 
justment. Beck (1) states that the 
use of shading to give vista effects 
indicates feeling of inferiority and de- 
pression while the use of shading to 
give hazy, diffuse effects, as clouds or 
smoke, indicates passive reaction to 
threat. Klopfer and Kelley (5) inter- 
pret the use of diffusion responses (K) 
as indicating feelings of insecurity and 
anxiety while the use of shading to 
give surface color to an object (C’) 
represents feelings of depression. 

It was thought that a study of the 
shading contrast variable in the stim- 
ulus ink blot might (a) throw further 
light on the general problem of to what 
extent the ink blot influences the re- 
sponse of S; (b) more specifically, 
indicate to what extent the amount 
and kind of shading responses given 
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by S are functions of this variable, and 
finally, (c) by isolating a factor which 
may influence the production of par- 
ticular kinds of shading responses, 
point the way to the development of 
ink blots specifically designed to elicit 
these responses, much as Levy (7) has 
developed plates specifically designed 
to elicit human movement (M) re- 
sponses. Such ink blots might have 
value in experimental designs where 
the shading response is the primary 
variable under consideration. 


MeETHOD 


Subjects —The population for this experiment 
consisted of 75 students at Michigan State Col- 
lege. These Ss were divided into three groups 
of 25 each, equated for age and sex. There were 
18 women and 7 men in each group, and the mean 
ages of the groups were 20.4, 19.8, and 20.1 yr. 

All Ss were volunteers. A majority of the 
psychology majors used in the experiment were 
enrolled at the time in the beginning experimen- 
tal psychology course at Michigan State College. 
The non-psychology major students came from a 
wide variety of fields. 

Previous test experience with the Rorschach 
or projective techniques disqualified a student 
as an S for this experiment. 

Apparatus and test materials ——A couch with 
reclining back was provided for S. The stimu- 
lus cards were held in a fixed position 60 cm. 
from S’s eyes by a suitable stand. A 60-w. 
lamp, 120 cm. from the card, provided illumina- 
tion. 


TABLE 1 


REFLECTANCE oF Controt Greys FoLLow1ncG 
PuotocrRAPHic REPRODUCTION 














Per Cent Reflectance (Incandescent 
Light) 
Number 
of Grey 
Azo paper Azo paper Azo paper 

No. 0 0.2 No. 4 

1 45.79 45.57 50.02 

2 43.64 44.11 48.73 

3 42.79 43.61 47.31 

4 40.38 42.28 41.08 

5 36.85 35.96 31.66 

6 31.19 27.84 21.42 

7 25.28 23.76 9.39 

8 18.14 11.03 1.58 

9 14.50 6.92 1.14 

10 11.91 4.01 1.04 























TABLE 2 
EXPERIMENTAL Desicn 
Shading Contrast 
Group 
Weak Medium Strong 
I B* S A 
II A B [ 
Ill Cc A B 











* Refers to batches of seven cards. 


The first seven cards of the Rorschach (6), 
Behn-Rorschach (8), and Harrower (4) series of 
ink blots were used as the stimuli. These 21 ink 
blots were photographed on Kodak Commercial 
film, a film which seemed to give the best rendi- 
tion of the red coloring on Cards II and III in 
each series. Each of the resulting negatives 
was then reproduced by the contact process on 
Kodak Azo papers No. 0, 2, and 4, making a total 
of 63 ink blots, 21 of light shading contrast, 21 
of medium contrast, and 21 of strong shading 
contrast. From data supplied by the manu- 
facturer (2) it appeared that Azo papers No. 0, 
2, and 4 would yield approximately equal inter- 
vals of shading contrast. 

At the time the 21 ink blots were photo- 
graphed, a control figure consisting of a series of 
greys was also photographed, and treated by the 
same photographic process. This control nega- 
tive was also reproduced on Kodak Azo papers 
No. 0, 2, and 4. All the cards and the control 
greys were photographed, developed, and 
printed at the same time under similar condi- 
tions. 

The reflectance of the control greys was 
then determined by the use of the Macbeth 
illuminometer. The results appear in Table 1. 

Procedure—Each of the 21 ink blots was 
assigned to one of three batches of seven cards. 
The three groups of Ss were shown the stimulus 
cards in a modified Latin square design as shown 
in Table 2. An S in Group I, for example, saw 
a card which appeared in Batch A under strong 
conditions of shading. All Ss saw all of the 21 
ink blot forms, seven with weak shading contrast, 
seven with medium shading contrast, and seven 
with strong shading contrast. The order of 
presentation of the stimuli was random. 

Since the presence of shading in ink blots is 
thought by Rorschach workers to be emotionally 
disturbing to some Ss, continuous records of 
circulatory and respiratory changes were made. 
However, since no changes in physiological re- 
sponses were found with respect to the degree of 
shading contrast, the present report will be 
confined to the verbal productions of Ss. 
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The S was given the following instructions: 
“People see all sorts of things in these ink blot 
pictures, now tell me what you see, what it 
might be for you, what it makes you think 
of.” 

The first card was placed in the card holder 
and all responses were recorded verbatim by E. 
When S indicated that he was through, the card 
was removed, the next card placed in the holder, 
and the procedure repeated. If S was not 
through at the end of 90 sec., he was allowed to 
finish the response he was giving after which the 
card was exchanged. This procedure continued 
until all 21 ink blots had been seen. 

In general, scoring criteria according to 
Klopfer and Kelley (5, pp. 119-141) were used 
to determine into which shading category a re- 
sponse fell. Since there was no inquiry, only 
the performance proper or “free association” re- 
sponses could be utilized in evaluating the type 
of shading used. 


1. Depth responses (FK) were scored when 
the concept was a landscape, scene, valley, moun- 
tain, cave, etc. Reflections were not scored FK 
unless the reflected object itself utilized shading, 
ie., “a lake shore with surrounding trees and 
brush reflected in the water” was scored FK, 
while “a person looking at his reflection in a 
mirror” was not. 

2. Diffuse responses (K) were scored when the 
object seen was a cloud, fog, haze, or smoke. 

3. Surface color (C’) was scored when the 
object seen was described as being black, white, 
or intermediate shades of grey or where the ob- 
ject was described as being striped. 

4. Toned-down depth (k) was scored when 
the object seen was an X-ray or a topographical 
map. 

5. Surface texture responses (c) were re- 
sponses where the seen object was described as 
being furry, rough, grainy, hairy, pebbly, etc., 
or where highlights were specifically mentioned. 

6. The number of times shading was used as 
a determinant on each card was found by adding 
the five shading categories, above, for each card. 


The responses were scored without regard to 
form (whether predominant or subordinate) and 
no distinction was made between main and addi- 
tional responses. Concept organization was 
not considered. 


RESULTS 


The effect of degree of shading con- 
trast on the number of times shading 
was used in responding to the ink 


* Standard Rorschach instructions according 
to Klopfer and Kelley (5, p. 32). 
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blots was determined by an analysis 
of variance (Table 3). This analysis 
indicates that the number of times 
shading was used did not vary sig- 
nificantly with the degree of shading 
contrast. 

Table 4, which summarizes the find- 
ings with respect to the various kinds 
of shading used, indicates that the 
number of times shading was used as 
surface texture (c) did not vary sig- 
nificantly with the shading contrast. 
The shading response categories, dif- 
fuse effects (K) and toned-down depth 
effects (k), likewise showed no signifi- 
cant increases or decreases with re- 
spect to this variable. 

However, as the degree of shading 
contrast in the ink blots was increased, 
there was a significant increase in the 
number of times shading was used as 
achromatic color (C’). The total 
number of C’ responses given under 
weak conditions of shading contrast 
was 47, under medium conditions of 
contrast it was 60, and under strong 
conditions it was 70. These differ- 
ences are significant at the 5% level 
of confidence. 

The number of responses utilizing 
shading as depth (FK) also varied sig- 
nificantly with the degree of shading 
contrast. Under weak conditions of 
shading, the total number of FK re- 
sponses was 26, under medium condi- 


TABLE 3 


Errect or Decree or SHADING 
CoNTRAST ON THE PropucTion 
or Responses Usinc SHADING 








oe se df ss F 
Groups 2 34.032 3.06 
Shading 2 24.413 2.19 
Batches 2 103.841 9.32* 
Cards 18 2016.984 20.12* 
Error 38 211.570 

Total 62 2390.840 














* Significant at the 1% level of confidence. 
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TABLE 4 


ANALYsIs oF VARIANCE OF Types or RESPONSES 














Toned-Down 
Source of Surface Texture Achromatic Color Depth Diffuse Effects Depth Effects 
Variance | df 
ss F ss F ss F ss F ss F 
Groups 2 1.365 | <1 9.238 | 2.83 10.889 | 3.05 6.889 | 3.72* | 3.937] 4.07* 
Shading 2 2.317} <1 6.333 | 3.89* | 11.555 | 3.24* 793) <1 127} <1 
Batches 2 42.317 | 3.82* 3.523] 1.08 413) <1 508} <1 2.112} 4.36* 
Cards 18 | 943.079 | 9.45**| 440.381 | 15.02**| 198.317 | 6.17**| 91.651 | 5.51**| 12.889] 1.48 
Error 38} 210.668 61.906 67.810 35.143 18.380 
Total 62 | 1199.746 527.714 288.984 134.984 39.556 






































* Significant at the 5% level of confidence. 
** Significant at the 1% level of confidence. 


tions it was 50, and under strong con- 
ditions it was 40. This finding is sig- 
nificant at the 5% level of confidence. 


Discussion 


One finding of the present study is 
that increasing the degree of shading 
contrast results in an increase in the 
number of responses given which uti- 
lize the shading to give surface color 
to the thing seen (C’). A possible 
explanation for this increase may be 
found in an examination of what hap- 
pens to the ink blot when the degree 
of shading contrast is increased photo- 
graphically. When this is done, the 
number and amount of intermediate 
greys which appear in the ink blot are 
reduced; the darker greys tend to- 
ward the black end of the scale and 
the lighter greys tend toward white. 
Thus an ink blot reproduced under 
strong contrast conditions abounds in 
black and white whereas the same ink 
blot reproduced under weak contrast 
conditions contains hardly any black 
or white and abounds in intermediate 
greys. Black and white as surface 
colors may lend themselves to more 
objects in nature than do the inter- 
mediate greys, resulting in an increase 
in the number of responses utilizing 
surface color when more black or 
white is present. 


Another explanation might be that 
the increased contrast between the 
black and white itself serves to attract 
the attention of S to the achromatic 
color, resulting in an increased use of 
the achromatic color in his response. 

As a result of varying the degree of 
shading contrast in the ink blots there 
are also differences in the number of 
responses given utilizing the shading 
as depth. The difference here does 


* not seem to be proportional to the 


degree of shading contrast in the 
cards; the ink blot with the weakest 
shading contrast produced fewest 
depth responses but the medium 
shaded cards produced more depth 
responses than did the cards with the 
strongest degree of shading contrast. 
The only significant difference is be- 
tween the weak shading card and the 
medium shaded card. 

An examination of the three cards 
which produced the most responses 
utilizing the shading to give depth to 
the response prompts three possible 
explanations. The process of increas- 
ing the contrast tends to lighten the 
greys which appear at the edges of the 
blots, causing them to blend with the 
white surround. The resulting hazi- 
ness of outline gives an aerial per- 
spective to the blot forms, resulting 
in more seen depth as the contrast is 
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increased. Secondly, the increased 
number of depth responses may be 
attributed to the secondary cue over- 
lay. Some of the ink blots seem to 
have layers superposed one on an- 
other. With increased contrast be- 
tween the black and white, the over- 
laying layers may have become more 
apparenttoS. Thirdly, the increased 
contrast may enhance the depth pro- 
ducing attached shadow effect of some 
parts of the blots. 


SUMMARY 


Seventy-five Ss were shown ink 
blots reproduced under weak, medium, 
and strong conditions of shading con- 
trast. Responses utilizing the shad- 
ing to give surface texture (c), diffusion 
effects (K), and toned-down depth 
effects (k) did not vary with contrast, 
nor were there changes in the number 
of shading responses produced. How- 
ever, it was found that there was an 
increased tendency to produce depth 
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responses (FK) and to utilize the 
shading as surface color (C’) when the 
shading contrast was increased. 


(Manuscript received April 24, 1951) 
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RATE OF MANIPULATIVE LEARNING AS A FUNCTION 
OF GOAL-SETTING TECHNIQUES ! 
GERALD C. HELMSTADTER AND DOUGLAS S&S. ELLIS 
Towa State College 


Attempts to maximize the motiva- 
tion of individuals performing a task 
frequently involve the use of both 
knowledge of results and a goal against 
which S’s performance is compared. 
Although knowledge of results is a 
motivator of demonstrated signifi- 
cance (6), few systematic studies have 
investigated the combined motiva- 
tional effects of knowledge of results 
and goal-setting manipulations. 


The studies of Mace (5) and Bayton (1) are 
typical. Mace found that a moving standard 
based on previous performance was more effec- 
tive in a computational task than simple in- 
structions to “do your best.” Both of these 
techniques were superior to a fixed goal set by E. 
The least effective technique was simple in- 
structions to surpass previous performance. 

Working with a simple manipulative task, 
Bayton found that four motivational techniques 
could be ranked from most effective to least 
effective as follows: (1) S sets own goal and tells 
E what it is; (2) S sets own goal but does not 
tell E what it is; (3) S is given knowledge of re- 
sults with no other comment; (4) S is told 
nothing concerning his previous performance. 
Statistical analysis revealed that the only signifi- 
cant differences were between Methods 1 and 
4 and between Methods 2 and 4. 


These two studies suggest at least 
three parameters that may define the 
influence of motivational techniques 
which combine knowledge of results 
with a standard or goal. Mace’s 
finding that a moving standard was 
more effective than a fixed one indi- 
cates that the relationship between 
the standard and S’s performance is 
important for externally set goals. 
Bayton’s results suggest that, for self- 
set goals, the degree of S’s ego involve- 
ment in his goal should be considered. 


1 Paper presented at the 1951 meeting of the 
Iowa Academy of Science. 


Finally, whether the goal is self-set 
(Bayton’s most effective manipula- 
tion) or externally-set (Mace’s most 
effective manipulation) may be a 
critical dimension of goal-setting tech- 
niques. 

The present study explores this last 
dimension by utilizing the following 
four motivational conditions: 


I. Knowledge of results. 

II. Self-set: S sets own goal on the basis of 
past performance? 

III. Externally-set norm: E provides a goal 
based on an average performance curve. 

IV. Externally-set improvement: E£ pro- 
vides a goal based on S’s previous per- 
formance and increments on an average 
performance curve. 


Conditions II and IV are, respec- 
tively, the most effective techniques 
used by Bayton and Mace. Their 
comparison should provide evidence 
of the importance of who sets the goal. 
Condition III, a technique commonly 
used in practical situations, serves as 
an additional externally-set goal con- 
dition for comparison with the self-set 
goal of Condition II. Condition I is 
included as a control to determine if 
any of the goal-setting techniques used 
provide motivation beyond that as- 
sociated with simple knowledge of 
results. 


METHOD 


Subjects and experimental design—A total of 
100 volunteers from undergraduate psychology 
courses at Iowa State College served as Ss. 
Except for the first five Ss who were placed in 
Group I, they were assigned to the four groups 
corresponding to the four motivational conditions 
so that the groups were fairly well matched on 


? This method of goal setting is equivalent to 
the manipulation followed in level of aspiration 
experiments (7). 
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initial score. The sex distribution for the groups 
was as follows: Groups I, III, and IV: 12 females, 
13 males; Group II: 11 females, 14 males. 
Task.—The task used was a modified form of 
the block-turning portion of the Minnesota Rate 
of Manipulation Test. The S was required to 
work continuously through the middle two rows 
of blocks, and was given standardized instruc- 
tions and a demonstration of the work methods 
to be used. Following an untimed practice 
trial, ten 60-sec. trials were administered with a 
60-sec. rest interpolated between trials. The 
number of blocks turned per 60-sec. trial was 
taken as the measure of performance. 
Goal-setting procedures.—During rest periods, 
knowledge of results was given to all Ss by show- 
ing S a consecutive plot of his previous perform- 
ance scores. The Ss in the knowledge of results 
group (Group I) received no additional treat- 
ment. The Ss in the self-set goal group (Group 
II) were asked to state an expected score for 
their next trial, and this aspiration score was 
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also plotted on the graph. A complete average 
performance or norm curve against which S could 
compare his performance was shown to members 
of the externally-set norm goal group (Group 
III) after each trial. For members of the ex- 
ternally-set improvement goal group (Group IV), 
the goal set for S on each trial was determined by 
adding to his previous performance the incre- 
ment on the norm curve between the same two 
trials.? 


RESULTS 


The learning curves of the four 
groups are presented in Fig. 1. The 
apparent similarity of the curves is 


%The norm curve was an estimate based on 
the results of previous investigations (3,4) as 
well as the scores of the first five Ss tested in the 
present experiment. Evidence of the reason- 
ableness of the various goals used is presented in 
Fig. 2. 
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Fic. 1. Block-turning performance as a function of motivational techniques. Group I—knowl- 
edge of results; Group II—self-set goal; Group [[I—externally-set norm goal; Group [V—externally- 
set improvement goal. 
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Fic. 2. Goals associated with the three goal-setting techniques and 
performance of the knowledge of results group (Group I). 


supported by the analysis of variance 
arrayed in-Table 1,‘ which indicates 
that none of the goal-setting tech- 
niques differed significantly in per- 
formance improvement (Trial 10 
minus Trial 1) during the course of 
the experiment. The significant sex 
difference indicates that the males 
showed greater improvement than the 
females.5 


‘This analysis was based on Snedecor’s 
method for disproportionate subclass numbers 
(8, pp. 289-290). Appropriate statistical tests 
(2, p. 11; 8, p. 285) indicated that the data met 
the assumptions necessary for such an analysis. 

5 Although the use of difference scores mini- 
mizes differences in initial ability of the groups, 
an analysis of variance of initial scores showed no 
statistically significant difference between the 
groups or sexes in respect to initial ability. The 
fact that the sexes differed in improvement but 


Figure 2 compares the average per- 
formance of the knowledge of results 
group (Group I) with the goals associ- 


TABLE 1 


ANALyYsIS OF VARIANCE OF 
IMPROVEMENT Scores 











Source of Variation df ss F 
Goal-setting techniques 
(GST) 3 84.55] .41 
Sex 1| 275.61 | 4.01* 
Sex X GST 3} 103.04] .50 
Error 92 | 6320.91 
Total 99 | 6784.11 














* Significant at the 5% level of confidence. The 
5% and 1% values of F are, respectively, 3.95 and 6.92. 
not in initial ability might be due to such factors 
as (a) differences in motivation, (b) differences 
in amount of pre-experimental practice on simi- 
lar tasks, (c) sampling errors, etc. 
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ated with the three goal-setting techni- 
ques used. It can be seen that the 
goals, whether self-set or externally- 
set, are generally close to the actual 
performance of the knowledge of 
results group. 


Discussion 


Since performance was used as an 
index of motivation in the present ex- 
periment, the results clearly imply 
that none of the goal-setting tech- 
niques used provided motivation be- 
yond that associated with simple 
knowledge of results. Of particular 
interest is the fact that self-set goals 
were no more effective than extern- 
ally-set goals of comparable magni- 
tude. 

It is, of course, possible that the ex- 
perimental situation used may not 
have been favorable for the differen- 
tial operation of externally-set and 
self-set goals. The use of knowledge 
of results with college student Ss may 
provide such high motivation that the 
effects of additional motivational ma- 
nipulations are negligible. The sim- 
plicity of the task and the use of dis- 
tributed practice may have operated 
to reduce the level of motivation suf- 
ficient for maximum performance, 
which would also tend to decrease 
differences between experimental 
conditions. Finally, the closeness of 
the externally-set and self-set goals 
to actual performance and to each 
other should be mentioned. It is 
possible that any differential effect of 
externally-set and self-set goals would 
depend on the goals being set quite a 
distance above actual performance. 

These considerations suggest the 
desirability of examining the following 
variables in further investigations of 
the relative influence of externally-set 
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and self-set goals: (a) initial motiva- 
tion of Ss; (b) task complexity; (c) 
inhibitory characteristics of the con- 
ditions under which the task is 
performed, and (d) relationship be- 
tween goals and “normal” perform- 
ance. 


SUMMARY 


An experiment was conducted to in- 
vestigate the effects of several goal- 
setting techniques on the performance 
of-a simple block-turning task. The 
investigation utilized a total of 100 Ss, 
distributed among four groups corre- 
sponding to the following motivational 
conditions: (a) knowledge of results; 
(b) self-set goal; (c) externally-set 
norm goal; (d) externally-set im- 
provement goal. 

The obtained results indicated that 
none of the three goal-setting tech- 
niques provided greater motivation 
than simple knowledge of results. 
Several variables which may limit the 
generality of this conclusion were 
specified. 


(Manuscript received May 4, 1951) 
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REACTIVE INHIBITION AS A FACTOR IN 


MAZE LEARNING: III. EFFECTS IN 
THE HUMAN STYLUS MAZE? 


MERRELL E. THOMPSON 


University of Arkansas 


This study is the third in a series of 
investigations of the role of reactive 
inhibition in maze learning. In the 
first study (7), Hull’s assumption that 
reactive inhibition is an increasing 
function of work was confirmed in a 
maze situation. In the second study 
(8) the role of reactive inhibition was 
demonstrated in studies of place learn- 
ing versus response learning. The 
purpose of the present study is to ex- 
tend the application of the principle 
of reactive inhibition to encompass 
the aspect of maze behavior formerly 
attributed to centrifugal swing. 

Hull (5) has assumed that each re- 
action of an organism builds up a state 
of negative motivation within the or- 
ganism which tends to prevent the 
repetition of that response for a short 
time. This state of negative motiva- 
tion is temporary in character, most 
of it showing spontaneous dissipation 
after a period of time. This means 
that when an animal makes a choice 
in a maze (e.g., the right alley) and 
receives an appropriate reward, two 
things occur: (a) the act of turning 
right is strengthened, and (b) inhibi- 
tion against turning right again for 
some time is developed. If the next 
trial follows immediately, the animal 
will tend to alternate his response, 
that is, turn left at the choice point. 


This is somewhat similar to the maze situation 
in which the animal makes a forced turn preced- 
ing the choice point and then tends to choose the 


1 This paper is a modified version of a report 
presented by the writer at the 1950 meeting of 
the APA. The writer wishes to express his 
gratitude to Dr. R. H. Burros for his advice on 
the statistical treatment of the data. 


alley in the opposite direction. This latter phe- 
nomenon has been formerly interpreted in terms 
of centrifugal swing which was originally derived 
from Schneirla’s studies on learning and orienta- 
tion in ants (6) and denotes a relationship be- 
tween turns at a choice point and the immedi- 
ately preceding pattern of movement forced by 
the spatial arrangement of the maze alleys. 
Later, Ballachey and Krechevsky (2) and Bal- 
lachey and Buel (1) demonstrated the operation 
of centrifugal swing in the choice-point behavior 
of rats. Still more recently, Bitterman and 
Bretz (3) have applied the concept to human 
maze learning. The basic assumption of this 
hypothesis is that an organism tends to be forced 
against the wall away from the direction of a 


forced turn, thereby facilitating a choice in the . 


direction opposite to that of the preceding forced 
turn. Thus, we would expect a preponderance 
of left turns at a choice point which was just 
preceded by a forced right turn. 

While the experimental evidence on this 
point leaves little to be desired, the explanation 
of the phenomenon, from a behavioral point of 
view, is far from adequate. The chief weakness 
of this hypothesis is that there is no way of in- 
corporating it into a broader systematic theory 
of learning. The hypothesis of centrifugal 
swing may provide an adequate vehicle for the 
explanation of choice-point behavior following a 
forced turn but it adds nothing to our under- 
standing of other maze behavior. 

The principle of reactive inhibition, on the 
other hand, has the advantage of explaining a 
much wider range of phenomena. Hull has suc- 
cessfully employed it in his explanations of many 
of the phenomena of conditioning and mention 
has been made of two previous attempts to ex- 
tend the principle to encompass certain aspects 
of maze behavior. The present study is an 
attempt to show that the principle of reactive 
inhibition may more adequately cover maze 
behavior formerly attributed to centrifugal 
swing. 


The prediction of choice-point be- 
havior in all the maze patterns char- 
acterized by one preceding forced 
turn would be the same whether one 
held to the hypothesis of centrifugal 
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swing or to that of reactive inhibition. 
However, since centrifugal swing, due 
to its physical character, cannot ac- 
cumulate (it is entirely dependent 
upon the just preceding turn) it is 
possible to arrange a maze situation 
which critically tests the two assump- 
tions. 

If we have two or more forced con- 
secutive like turns preceding the 
choice point, we will expect, if centri- 
fugal swing is the determining factor, 
no greater proportion of opposing 
turns at the choice point than in a 
maze where there is only one preced- 
ing forced turn. However, with the 
principle of reactive inhibition, which 
assumes that the inhibition is a cum- 
ulative affair and that two like turns 
will develop more inhibition than one 
turn, we will expect more opposing 
turns at the choice point when it is 
preceded by two forced like turns 
than by only one forced turn. 


EXPERIMENTAL PROCEDURE 


Subjects—The Ss were 120 college students 
from the general and experimental psychology 
courses at Kent State University. There were 
79 males and 41 females assigned at random to 
the various groups. 

A pparatus.—The present investigation uti- 
lized single-choice, grooved-type stylus mazes. 
In Fig. 1 are illustrated the three mazes, each in 
four positions, used in the study. It is to be 
noted that Maze 1 consisted of a choice point 
preceded by one clockwise turn, Maze 2 of a 
choice point preceded by two clockwise turns, 
and Maze 3 of a choice point preceded by three 
clockwise turns. 

All three mazes were identical with respect to 
width of alleys, length of path to choice point, 
and length of alleys from choice point to the 
goals. They differed in only one respect, the 
number of clockwise turns preceding the point 
of choice. The alleys were } in. deep and } in. 
wide, the distance from starting point to choice 
point was 8 in., and the length of each alley from 
choice point to goal was 1 in. Each maze was 
fastened to a board 9 in. square. 

Procedure-—The Ss were assigned at random 
to one of the 12 maze patterns (three mazes in 
four positions) and given one trial on that pat- 
tern. The maze was placed in a grooved slot on 
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MAZES 


Fic. 1. The three mazes, each in four 
positions, used in the study 


a table behind a black cloth screen containing a 
narrow slit through which S’s arm was permitted 
to pass. When S arrived for the experimental 
session the following directions were read by E: 

“We want to determine how Ss will respond to 
different mazes. There are no correct or incor- 
rect turns. I will place the stylus which you 
hold at the starting point; then say ‘begin.’ 
Start pushing the stylus down the maze alley. 
Do not hesitate at the turns but proceed until 
I say ‘goal.’ You are to hold the stylus in 
your preferred hand as you hold a pencil, like 
this (demonstrate). Keep your hand and arm 
off the table and maze. Are there any ques- 
tions? If not, we will proceed.” 


REsuLTs AND Discussion 


The design of the experiment per- 
mits an analysis in terms of the num- 
ber of counterclockwise choices as a 
function of the position of the respec- 
tive mazes and as a function of the 
number of preceding forced clockwise 
turns. Ordinarily this factorial de- 
sign would permit a straightforward 
analysis of variance treatment. In 
the present study, however, such a 
treatment is not practical since each 
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S had only one trial which yields a 
single score of either one or zero. 
For this reason the chi-square test 
was used. 

Since we were primarily concerned 
with testing the null hypothesis (that 
Ss’ choices of the counterclockwise 
alley at the choice point in the three 
mazes were by chance), the first step 
was to determine whether or not the 
frequencies of choices for the four 
positions used in each maze could be 
pooled. This was essentially a test of 
technique as described by Edwards 
(4, pp. 108-113). 

The results of this analysis are 
shown in Table 1. Since none of the 
chi-squares is significant, the varia- 
tions within the maze position groups 
for the three mazes may be assumed to 
be no greater than expected in random 
sampling from a common population. 
This finding allows us to pool the data 
from the four subgroups for each 
maze and then to apply the chi-square 
test to the pooled frequencies. The 
hypothesis to be tested is that the 
three maze groups have been ran- 
domly drawn from a common popu- 
lation. The pooled frequencies of 
counterclockwise turns for Mazes 1, 
2, and 3 were 16, 32, and 34, respec- 
tively. Since the obtained value of 
x? (22.54) is highly significant for 2 
df the null hypothesis tested must be 
regarded as untenable. 


TABLE 1 


NuMBER OF COUNTERCLOCKWISE CHOICES 
(N = 10 each group) 

















Position Maze 1 Maze 2 Maze 3 
A 7 6 7 
B 4 9 8 
Cc 3 9 10 
D 2 8 9 
x? 5.83 3.75 3.92 
? 12 29 27 
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The results of the present study 
clearly demonstrate that the choice- 
point behavior of human Ss is influ- 
enced by the number of preceding 
forced turns in the types of mazes 
employed. The effect of increasing 
the number of forced clockwise turns 
resulted in a significantly greater num- 
ber of counterclockwise turns at the 
point of choice. In the present study 
the mechanics of movement through 
the alleys will not account for the re- 
lationship since the tendency for the 
counterclockwise choices increased as 
the number of preceding clockwise 
turns was increased. 

If centrifugal swing is the predom- 
inant factor in choice-point behavior 
in these mazes we would expect the 
number of counterclockwise choices 
to be the same in all three choice- 
point situations regardless of the num- 
ber of preceding clockwise turns. Un- 
fortunately this aspect has been over- 
looked in the many attempts to 
explain choice-point behavior follow- 
ing forced turns in terms of centrifugal 
swing. The data from studies with 
rats (9) offered in support of this hy- 
pothesis have in fact shown the same 
relationship between the number of 
forced like turns preceding the choice 
point and choice-point behavior, al- 
though the implications of this rela- 
tionship have not been noted. 

In conclusion, we can say that, for 
human Ss, the number of counter- 
clockwise choices will increase sig- 
nificantly as the number of forced 
clockwise turns is increased, the num- 
ber of counterclockwise choices being 
some increasing function of the num- 
ber of preceding clockwise turns. 
And we may add, in the interests of 
an integrated and systematic science, 
that since the principle of reactive 
inhibition can be applied to a broader 
range of behavioral phenomena, its 
use in explaining the effect of preced- 














REACTIVE INHIBITION IN LEARNING 133 


ing forced turns upon choice-point be- 
havior is preferred over the use of the 
concept of centrifugal swing, limited 
as it is. 

SUMMARY 

1. In order to determine which of 
two hypotheses, centrifugal swing or 
reactive inhibition, is more adequate 
in explaining choice-point behavior in 
maze patterns characterized by one or 
more forced like turns preceding the 
choice, 120 college students were given 
one trial each on one of 12 maze 
patterns. There were three single- 
choice, grooved-type stylus mazes 
(each in four positions) differing only 
in the number of forced clockwise 
turns preceding the choice point. 

2. The over-all effect of increasing 
the number of forced clockwise turns 
is an increasing tendency for counter- 
clockwise alleys to be chosen at che 
point of choice as shown by a chi- 
square of 22.54 with 2 df. 

3. It is suggested that the principle 
of reactive inhibition provides a more 
adequate explanation of this aspect of 
maze behavior than does the hypothe- 
sis of centrifugal swing. 


(Manuscript received May 10, 1951) 
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LATENT LEARNING IN A WATER MAZE 


A. W. BENDIG! 
University of Pittsburgh 


In recent reports on studies of la- 
tent learning two aspects of the prob- 
lem have been ascribed an increasing 
degree of importance: (a) the level of 
motivation of the Ss during the pe- 
riod when latent learning could be 
expected to occur, and (b) the ability 
of the Ss to discriminate the drive 
states used. Munn (7) points out 
that experiments involving weak mo- 
tivation tend to demonstrate latent 
learning, whereas strong motivation 
tends to discourage such learning. 
The results of Meehl and MacCor- 
quodale (5) confirm that rats strongly 
motivated tend not to show cognitive 
types of learning, but animals weakly 
motivated do. Such results suggest 
that Tolman’s concept of the role of 
strongly motivating conditions in 
“narrowing the cognitive map” (9) 
may be applicable here. 

As to motivational discriminability, 
Tolman (10) suggests that one of the 
important types of learning is that of 
“drive discrimination,” and that these 
discriminations are particularly im- 
portant in latent learning experi- 
ments. Heron’s work on the ability 
of rats to discriminate different drives 
or different levels of the same drive 
(2) is relevant to this point. In dis- 
criminating between alternating states 
of hunger and thirst with one of the 
pair of drives satiated the animal may 
not be learning the difference between 
hunger and thirst drive stimuli, but 


1 These experiments were performed in the 
Laboratory of Neurophysiology, Western Psy- 
chiatric Institute and Clinic, and were supported 
by a research grant from the U. S. Public Health 
Service. Appreciation is expressed to Dr. 
Robert A. Patton for his advice and supervision 
during this research. 


simply the presence or absence of one 
of the sets of stimuli. Since these 
two drives are notoriously difficult to 
separate experimentally (2), the pres- 
ent research is concerned with using 
a different combination of basic drives: 
food deprivation and escape from 
water. 
EXPERIMENT [| 

Method 


Subjects—Twenty-three male rats of the 
Sprague-Dawley strain were used. The Ss 
ranged in age from 130 to 150 days at the begin- 
ning of their latent learning training and were 
experimentally naive. 

A pparatus.—The water-maze previously used 
in this laboratory (8) was modified to provide a 
T-maze pattern similar to that used by Kendler 
(4). The diagram of the maze is given in Fig. 1. 
The alleys were filled with water to a depth of 84 
in. and the water was maintained at a tempera- 
ture of 60° F. Sliding retrace doors were placed 
at points 4 and B in Fig. 1. Metal landing 
platforms were placed at X and Y and were 
separated by a solid metal partition impossible 
for the Ss to scale. 

Procedure.—Each S was given four trials per 
day during three days of training in the maze. 
On the first and third trials each day the Ss were 
placed in the maze at point S in Fig. 1, facing 
away from the choice point, and allowed to 
choose a right or left turn at the choice point. 
After reaching landing platform X or Y, S was 
allowed to remain for 30 sec. before being re- 
moved for the next trial. On the second and 
fourth trials each day S was forced to swim to 
the landing platform opposite to its previous 
choice. Forcing was done by closing the sliding 
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Fic. 1. Maze diagram 
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retrace doors at 4 or B. Thus, if S chose to 
turn to the right on his first trial and swim to Y, 
on his second trial he found door B closed and 
was forced to swim to X. 

During the three training days, food (Purina 
Laboratory Chow) and water were always avail- 
able in the Ss’ home cages. In the maze, food 
was always found by the rats on one of the land- 
ing platforms and never on the other platform. 
Half of the Ss found food always at X and never 
at Y, and the reverse conditions obtained for the 
other half. The landing platform containing 
food had Purina dog pellets piled so that the Ss 
had to climb over and sit on the food during 
trials to that side. During the three training 
days, when the Ss always had food available in 
the home cages, none was observed to eat the 
food during the training trials. 

After the last trial on the third training day 
the Ss were returned to the home cages from 
which all food had been removed. Twenty- 
three hours later they were given a single trial in 
the maze under the combined escape from water 
and (for the first time) hunger drives. The Ss 
were scored on this single trial as to whether they 
turned toward the side where they had always 
previously found food on the landing platform 
(food-side response) or toward the side where 
they had never found food (non-food-side re- 
sponse). 


Results 


On the single trial on the fourth day 
of the experiment, 14 of 23 Ss turned 
toward the side where they had al- 
ways previously found food and 9 Ss 
made non-food-side response. Test- 
ing this distribution against a chance 
or equal probability hypothesis gives 
a chi-square value of 1.087 which with 
1 df is not significant at the 5% level. 
The conclusion was drawn that, while 
a majority of the Ss performed as pre- 
dicted by a latent learning hypothesis, 
the results could be parsimoniously 
attributed to chance. 

Observations of Ss during the train- 
ing and testing days suggested that 
one complicating factor was the strong 
emotional reaction of Ss to the water- 
maze situation. Such strong emo- 
tional response may have tended to 
interfere with cognitive learning of the 
maze or with the operation of the 


hunger drive on the fourth day. Two 
additional experimental groups were 
therefore run to determine the effi- 
ciency of certain changes in the experi- 
mental procedure. 


EXPERIMENT II 
Method 


Subjects —The same 23 Sprague-Dawley rats 
of the previous study were utilized again. The 
animals were 185 to 210 days old at the begin- 
ning of this study. 

Apparatus.—The same maze was used as in 
the previous study. 

Procedure-—The’ procedure was similar to 
that used in the previous study, but with certain 
changes. These changes were directed toward 
reducing the strength of emotional motivation 
during the training days of the experiment. (a) 
The training trials were preceded by five days of 
adaptation in a water-filled straightaway as 
used by Russell (8). Four trials per day were 
given and on each trial the Ss were required to 
swim a 48-in. straightaway to a landing platform. 
Adaptation to the swimming situation appeared 
to take place over the 20 trials in the straight- 
away. (b) The Ss were given four days of maze 
training while satiated for food instead of three 
days as in the previous study. (c) The Ss were 
placed in the maze at point S in Fig. 1 facing 
toward the choice point instead of facing away 
from the choice point. It was noted in the previ- 
ous study that the Ss in making a 180° turn to 
approach the choice point tended to acquire a 
clockwise or counterclockwise turning habit. 
Such a habit may have tended to influence their 
response at the choice point. (d) Sliding re- 
trace doors were placed at points C and D in 
Fig. 1 rather than at 4 and B. This tended to 
eliminate useless responses at the choice point. 
(e) The water temperature was kept at 77° F. 
instead of the previously used colder tempera- 
ture. Using Wever’s data (11) this should have 
tended to lower the strength of the escape-from- 
water drive. In all other respects the procedure 
followed was that used in Exp. I. 


Results 


On the single hunger-motivated 
trial on the fifth day, 17 of 23 Ss 
turned toward the food-side and 6 
toward the non-food-side. Testing 
these proportions against a chance 
hypothesis gave a chi-square value of 
5.260, which is significant at the 5% 
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level. These results suggested that 
the changes made in the training pro- 
cedure increased the amount of latent 
learning shown by the Ss. To con- 
firm this, an independent replication 
of the above experiment was per- 
formed as part of a study reported 
elsewhere (1). 


EXPERIMENT III 
Method 


Subjects —The Ss were 23 male albino rats 
of the strain maintained by Albino Farms. 
They were 116 to 128 days old at the beginning 
of the experiment. 

Apparatus.—The same maze was used as in 
Exp. I and II. 

Procedure.—The same procedure was used as 
in Exp. II with the single exception that all Ss 
were given four trials in the maze on the fifth 
day, the conditions of each of the four trials 
being the same as the single trial on Day 5 in 
Exp. II. Only the first trial on that day is 
relevant here. 


Results 


Eighteen of the 23 Ss turned toward 
the food-side on the first trial of the 
hunger-motivated fifth day. The chi- 
square value is 7.348, which is signi- 
ficant at the 1% level. 


Discussion 


The results suggest that the phe- 
nomenon of “latent learning” was 
present in Exp. II and III, if not in 
Exp. I. The Ss learned the location 
of food in the water maze when the 
food itself was not a reward object 
or (more precisely) was not a relevant 
reward for the dominant drive. How- 
ever, the question can be raised as to 
whether the latent learning shown on 
the fifth day was actually “latent” 
during the training period. Meehl 
and MacCorquodale (5) note the ap- 
pearance of an increasing preference 
for the food-side during the training 
trials. From the free-choice trials of 
the four training days (Trials 1, 3, 5, 
7, 9, 11, 13, and 15) the percentage 
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of free responses directed each day 
toward the food-side was computed 
for the separate groups of Exp. II and 
III and is found in Fig. 2. It is evi- 
dent that a food-side preference was 
set up in the Ss during the training 
days. Thesomewhat different shapes 
of the curves of acquisition for Exp. 
II and III probably reflect sampling 
fluctuations, but may reflect strain 
differences. Whether this learning 
of the location of food and of showing 
a preference for the location when not 
experimentally motivated for the food 
is a function of Hull’s “secondary rein- 
forcement” (3), Miller’s “acquired 
drives” (6), or Tolman’s “equivalence 
beliefs” (10) cannot be answered by 
these experiments. However, some 
construct over and above that of pri- 
mary reinforcement is a necessity in 
explaining latent learning. 

It should be noted that latent learn- 
ing was more evident in the groups of 
Ss that had been habituated to the 
maze and whose escape-from-water 
drive had been somewhat reduced by 
raising the temperature of the water. 
This is in accord with the results of 
Meehl and MacCorquodale (5) who 


found latent learning only when weak 
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drives were present during the train- 
ing sequence. It also agrees with 
Tolman’s view of the effect of strong 
motivation (especially strong emo- 
tional motivation) on a narrowing of 
the cognitive map (9). In these more 
complex types of learning, strong 
motivation appears to have a dis- 
organizing effect and the optimum 
drive must be weak for such learning 
to take place effectively. 


SUMMARY AND CONCLUSIONS 


1. The phenomenon of latent learn- 
ing was demonstrated in a water 
maze. Escape-from-water and food- 
deprivation were used as motivating 
conditions in three experiments with 
rats, the training trials involving 
escape-from-water with food present 
and the Ss food-satiated, the test trial 
involving hunger motivation. 

2. The amount of latent learning 
was a function of the amount of pre- 
training received by the Ss, and the 
influence of the pretraining was to 
reduce the unorganized drive in the 
Ss. 

3. The latent learning is not “la- 
tent,” but appears in performance 
during the training period. 


(Manuscript received May 14, 1951) 
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STUDIES IN SHORT-DURATION AUDITORY FATIGUE: 


II. RECOVERY TIME 


ANITA I. RAWNSLEY AND J. DONALD HARRIS 
U. S. Naval Medical Research Laboratory 


The use of short duration stimula- 
tion is a very convenient approach to 
the many problems raised in the area 
of auditory fatigue. Some of the ad- 
vantages of this approach and a brief 
history have been given in an earlier 
paper (5). 

One of the major problems raised by 
partial stimulation deafness is the na- 
ture of the recovery process after 
stimulation. Goran de Maré (2, 3) 
first recognized that after even rather 
mild stimulation (at intensities previ- 
ously thought to cause no post-stimu- 
lation threshold shift) there was a 
definite, though very brief, time before 
complete recovery. He found that 
after stimulating the ear with a 500- 
cps tone at a sensation level of 88 db, 
fatigue declined from 14.9 db at no 
delay to 6.16 db after delay of .7 sec. 
De Maré argued that this effect was 
neither a decline of sensation nor a 
persistence of vibration in the sound 
conduction apparatus. Bronstein 
and Churilova (1) used a stimulating 
tone of 2-min. duration at 94-db 
loudness level, and found that the re- 
covery process was relatively delayed 
for the higher frequencies. Liischer 
and Zwislocki (6, 7, 8) systematically 
attacked the problem of recovery from 
fatigue caused by short duration 
stimulation. The stimulating tone 
was always .4sec.in duration. Under 
one specific’ fatiguing condition, 
namely, with 3000 cps at 80-db inten- 
sity, fatigue decreased in an exponen- 
tial manner from 39 db after 20 msec. 
to zero db after 200 msec. The ab- 
solute amount of fatigue depends upon 
several factors, such as the frequency 


and intensity of the stimulating tone, 
but for the present purpose we especi- 
ally note Lischer and Zwislocki’s 
conclusions that the recovery from 
fatigue is similar at all frequencies, 
following very nearly an exponential 
course irrespective of the initial ab- 
solute value of the fatigue. 

The present experiment is an exten- 
sion of Liischer and Zwislocki’s work, 
designed to explore more thoroughly 
the relations among fatigue, interval 
after stimulation, intensity of stimula- 
tion, and frequency. 


APPARATUS 


It was desired to present to S a pair of tones, 
each controllable independently as to frequency 
and intensity. The first of the pair was to be 
the stimulating tone, the second was to be a test 
or probe tone, the response to which would estab- 
lish the momentary auditory threshold a short 
while following the offset of the stimulating tone. 

Two Western Electric 6B audiometers were 
connected through a mixing network to a pair 
of Permoflux PDR-8 phones, fitted with supra- 
aural cushions, one for EF andoneforS. A three- 
channel predetermined electronic timer made by 
Potter Instrument Co. was used to control the 
duration of output of each audiometer, and of 
the interval between outputs. The audiometer 
dials themselves, in 2.5-db steps, controlled the 
frequency and intensity of the two tones. The 
electronic timer was placed outside the sound- 
proof room and operated by a microswitch in 
E’s hand. The frequency of the audiometers 
was set with reference to a Hewlet-Packard 
direct-reading frequency meter, accurate with- 
in +1 per cent. 


SuBJECTs AND ProcEepuRE 


Four normal-hearing Ss were used at the fre- 
quencies 500, 3000, and 8000 cps. Duration of 
the first or fatiguing tone was always 400 msec., 
of the second or probe tone always 50 msec. The 
fatiguing tone was either 10-, 30-, 50-, or 70-db 
sensation level. The interval of silence between 
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tones, during which recovery proceeded, varied 
from 20 to 300 msec. The frequency chosen for 
any experimental session was set on one audiom- 
eter with the frequency meter, and the other 
adjusted to it by the method of aural beats. 

Three thresholds were determined in each 
experimental session, all by the method of con- 
stant stimuli (five judgments at 2.5-db intensity 
steps in random order): (a) for the fatiguing 
tone alone; (b) for the probe tone alone; and 
(c) for the probe tone preceded by the fatiguing 
tone. 

The threshold shift between (b) and (c) was 
taken as the amount of fatigue. Five fatigue 
determinations were averaged for each combina- 
tion of frequency, intensity, and recovery inter- 
val. 


REsuLTs AND Discussion 


Figure 1 shows the rise in fatigue 
with increasing intensity, for seven 
recovery periods. With the shortest 
recovery period of 20 msec., a fatigue 
effect can be found even though the 
stimulating tone is only about 5 db 
above threshold. Notice also that as 
the intensity of the fatiguing tone is 
increased, each curve in Fig. 1 ex- 
hibits a double slope. Nothing of the 
sort occurs in the data of Liischer and 
Zwislocki (7, p. 433), but an even 
more perplexing shape appears in the 
data of Gardner (4, p. 184), and of 
Munson and Gardner (9, p. 180-181), 


where what these workers term a 
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“saturation effect” occurs, such that 
as the fatiguing tone is raised from 
about 40- or 50- to about 80-db sensa- 
tion level, relatively little additional 
fatigue is occasioned. The data of 
Munson and Gardner were gathered 
at recovery periods of about .1 and .2 
sec., and they state that “tests at 
shorter delay times indicate that 
saturation becomes less noticeable as 
this factor decreases until at zero 
time (i.e., for simultaneous masking) 
it is difficult to detect or entirely 
absent” (9, p. 180). However, it is 
seen in Fig. 1, and the same holds true 
for the other two frequencies, that the 
double-slope feature is maintained at 
all recovery periods used. 

It is clear that data such as in Fig. 1 
may be recast in another form to show 
the course of decline of fatigue. This 
is done for the three frequencies in 
Fig. 2,3,and4. The figures are inter- 
preted as follows: at 500 cps, when the 
first tone is of 70-db sensation level, 
fatigue has reached a level of about 
59 db after 20 msec., has declined to 
about 36 db after 120 msec., and has 
reached a zero value at 360 msec. 
The differences between the sets of 
data in Fig. 2, 3, and 4 may in part be 
attributed to the differential effect of 
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frequency at constant sensation level, 
as described in our earlier paper, and 
in part perhaps to the status of the 
particular ear at the several fre- 
quencies. 

The exact slope of the decline of 
fatigue is a matter of considerable 
interest. In general, it appears to 
follow a regular decline in decibels 
with time. 

In order to check the universality 
of this point, and to get an idea of 
individual differences, the decline of 
fatigue for a 50-db sensation-level 
tone of 2000 cps was determined for 
eight Ss. The average fatigue at each 
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recovery interval, and the standard 
error, are shown in Fig. 5. It is clear 
that the best fitting curve is a straight 
line for this particular sensation level, 
and that there are relatively slight 
individual differences in the course of 
the disappearance of fatigue. These 
eight Ss all had essentially normal 
hearing; it remains to be determined 
whether[the decline of fatigue takes 
this same form in cases with various 
types of deafness. There are, how- 
ever, systematic deviations from this 
straight-line recovery, especially at 
the lower intensities. At 30-db sensa- 
tion level, for example, Fig. 2-4 show 
that for all frequencies the recovery 
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proceeds with negative acceleration 
even when fatigue is plotted on a log 
scale. Of course, even when as at 
higher intensities the course of re- 
covery appears as a straight line on a 
log scale, it must be understood that 
if fatigue were to be plotted in absolute 
terms rather than on a log scale, the 
recovery process would still assume 
sharp negative acceleration. 

What, then, is the nature of this 
fatigue? It is not a function of the 
middle ear; in our laboratory we have 
subjected a patient with a radical os- 
silectomy to the same conditions as of 
Fig. 5, and find that the course of re- 
covery does not differ from the normal. 
Experiments on cross-conduction have 
ruled out a central locus. Evidently 
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on auditory fatigue 


the site is within the cochlea. Rosen- 
blith (10) and Rosenzweig and Rosen- 
blith (11) have shown recovery curves 
of the nerve response to clicks in the 
animal, both from the round window 
and the cortex. These curves bear 
some correspondences to the recovery 
data of this paper; but it is not clear 
that the fatigue process is of nervous 
origin. A decline in gross nerve re- 
sponse would occur as well if the site 
of the fatigue were, say, in the organ 
of Corti, or in the transmission mech- 
anism between the organ of Corti and 
the first-order neuron. 

Further information about the fa- 
tigue discussed here can be obtained 
from Fig. 6, which considers the effect 
of duration of the fatiguing tone. It 
is seén that the usual time-intensity 
relationship is here violated. Evi- 
dently the ear, though suffering from 
a process which reduces its threshold, 
is able to make an adaptation up to a 
point so that at moderate intensities 
no further fatigue will result from in- 
definitely prolonged stimulation. At 
somewhat higher levels the adaptation 
can be maintained though not for as 


long, and, of course, at extremely high 
intensities damage may result. But 
for intensities short of this extreme, 
fatigue must be a reversible, and in 
some cases noncumulative, adapta- 
tion, probably of the peripheral organ. 


SUMMARY AND CONCLUSIONS 


The technique of studying auditory 
fatigue using brief, weak tones has 
proved very fruitful, since the ear’s 
return to normal is so quick that a 
great deal of information can be 
gathered in a short time. This paper 
studies the course of the recovery 
from such fatigue as a function of 
stimulation frequency and intensity, 
and of recovery interval. 

To a pure tone of 500, 3000, or 8000 
cps up to 70-db sensation level, acting 
for .4 sec., recovery is complete within 
little more than a third of a second. 
For weaker stimuli recovery is pro- 
portionately quicker. At about 50-db 
sensation level, the disappearance of 
fatigue follows roughly an exponential 
course; at weaker levels a negative 
acceleration appears. Experiments 
with cross-conduction and on a pa- 
tient with ossicles missing rule out the 
central nervous system and the middle 
ear respectively as the locus of fatigue. 
An experiment on the effect of dura- 
tion of stimulation shows that the ear 
within at least 100 msec. or less ex- 
hibits a residual threshold shift due to 
stimulation, but that at moderate in- 
tensities no greater threshold shifts 
may be occasioned with fatiguing 
stimulus durations of at least 10 sec. 
and probably indefinitely. Whether 
this continuously adapting mechanism 
is within the peripheral organ, at the 
transmission between the sense cells 
and the first nerve fibers, or is purely 
neural, is not yet known. 


(Manuscript received May 21, 1951) 
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STUDIES OF DISTRIBUTED PRACTICE: V. LEARNING 
AND RETENTION OF CONCEPTS 


LEONARD OSEAS AND BENTON J. UNDERWOOD 
Northwestern University 


Previous studies in this series have 
examined the influence of short inter- 
trial rests on learning adjectives pre- 
sented serially (7, 8), as paired associ- 
ates (6), and in verbal-discrimination 
lists (9). In only serial learning was 
it shown that distributed trials pro- 
duced faster learning than did massed 
trials. Theoretically, the results have 
been negative in that we have found 
no substantial support for any con- 
temporary theories of verbal learning. 

The present study was designed to 
extend our empirical generalizations 
concerning the influence of distributed 
practice. Serial, paired-associate, 
and verbal-discrimination lists em- 
phasize what is commonly called rote 
learning in that the discovery of rela- 
tionships is minimized and the fixa- 
tion process maximized. Concept 
learning, on the other hand, requires 
that S discover relationships among 
stimuli before the response can be 
fixated. So far as we have been able 
to determine there are no published 
studies in which the relative influence 
of massed and distributed practice 
has been evaluated for concept learn- 
ing. Experiments varying the time 
interval between trials in solving 
puzzles (2, 3) have shown that massing 
tends to be more efficient than dis- 
tribution. The solution of these puz- 
zles, like the learning of concepts, re- 
quires the discovery of relationships. 
However, it is questionable as to 
whether or not any generalization is 
possible from the puzzle-solving re- 
sults since the intertrial interval has 
been inadequately explored. Massed 
practice, in the above-cited experi- 


ments, consisted of a few seconds be- 
tween trials, while distributed prac- 
tice in no case involved intervals less 
than 24 hr. between trials. In short, 
intervals which have shown facili- 
tation in rote learning (a few seconds 
to 2 min.) have not been systemati- 
cally investigated for tasks requiring 
the apprehension of relationships. 

In the present study simple con- 
cepts have been learned under four 
different conditions of distribution, 
namely, 6, 15, 30, and 60 sec. between 
each trial. 


PROCEDURE 


Materials.—Two sets of materials were used, 
one forming a practice list and one an experi- 
mental list. The stimuli for the experimental 
list consisted of three geometric forms: triangles, 
squares, and circles. Each form appeared in 
three sizes: } in., }in., and 2in.in height. Here- 
after these different sizes will be referred to as 
small, medium, and large. Finally, each form 
and size appeared in three shades (black, gray, 
white). Since all combinations of the three di- 
mensions (form, size, shade) appeared, a total 
of 27 different stimuli was used. Of the three 
dimensions, only size and shade were relevant to 
the concepts to be learned; form was the irrele- 
vant or distracting characteristic. All concepts, 
therefore, were determined by combinations of 
size and shade, and since all combinations of size 
and shade were used, one for each concept, there 
was a total of nine concepts to be learned. For 
example, large black square, large black circle, 
and large black triangle were all illustrations of 
a single concept (large black form). Thus the 
total of 27 different stimuli represented three 
illustrations of each of the nine concepts. 

As concept indicators (responses to be made 
by the Ss) single consonant letters were used. 
These were assigned so that obvious associations 
between the concept and the letter were not 
present. The concepts, and the letters assigned 
each, were as follows: large black form—R;; large 
gray form—H; large white form—N; medium 
black form—J; medium gray form—Q; medium 
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white form—X; small black form—Z; small 
gray form—P; small white form—Y. 

The stimulus figures were drawn in ink on 
white vellum tape; the concept indicators were 
typedonthetape. The 27 stimuli were repeated 
twice on the tape so that 54 figures appeared. 
These were divided into six groups of nine each, 
each group being separated from the next group 
by a strip of masking tape. The order of ap- 
pearance of an illustration of a given concept 
varied for each of the groups, but each concept 
was illustrated once and only once in each group 
of nine figures. Furthermore, the same example 
of a given concept never appeared in two suc- 
cessive groups. These arrangements made it 
highly unlikely that the concepts could be 
learned by rote. 

The practice-list stimuli also varied in three 

dimensions. These stimuli consisted of a 
straight inked line with small dots contiguous. 
The direction of the line varied three ways 
(vertical, horizontal, oblique); the position of the 
dots varied three ways (above the line, at the 
center of the line, below the line); and the num- 
ber of dots varied from one to three. Number of 
dots was the irrelevant dimension, the direction 
of the line and the position of the dots being 
relevant. In other details of construction the 
practice list was the same as the experimental 
list. None of the letters used as concept indi- 
cators was the same for the two lists. 
- Lists were presented by means of a Hull-type 
memory drum at a 3:3-sec. rate, i.e., 3 sec. for 
the stimulus and 3 sec. for the stimulus and re- 
sponse together. Learning was by the anticipa- 
tion method. 

Subjects and conditions—Four conditions 
comprised the experiment, being distinguished 
only by differences in the time lapse between each 
trial. A trial was counted as the presentation 
of one group of nine stimuli. Four groups of 
Ss, each group made up of 18 college students, 
were used. Ss were assigned at random to the 
four groups. In Group I, 6 sec. elapsed between 
each trial; this will be called the massed condi- 
tion. For Group II the intertrial interval was 
15 sec., for Group III, 30 sec., and for Group IV, 
60 sec. 

The Ss were instructed carefully on the prac- 
tice day concerning the nature of a concept and 
differences between rote learning and concept 
learning were emphasized. Following these in- 
structions the practice list was presented with S 
instructed to anticipate after the first trial. The 
list was presented by massed practice (6 sec. 
between trials) until four of the nine concepts 
were correctly anticipated on a single trial. 
From this point on learning took place by dis- 
tributed practice (15 sec. between trials) until a 
perfect recitation was achieved. During the 
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15-sec. interval between trials, S named colors 
from a color board made up of rows of small 
patches of different colors. These were called 
out at the rate of 72 per min. as paced by a 
metronome. This same color-naming proce- 
dure was used by Groups II, III, and IV on the 
experimental day. After attaining the criterion 
on the practice list, S rested 10 min. and then 
recalled and relearned under massed conditions. 

The experimental session followed the prac- 
tice session by not more than three days and 
not less than one day. On the experimental 
day the experimental list was presented under 
the conditions specified above for each group 
until two successive perfect trials were attained. 
Each S returned to the laboratory 24 hr. later 
and relearned the experimental list by massed 
practice. 


RESULTS 


Practice-day performance.—The 
mean number of trials required to 
learn the practice list of concepts by 
Groups I through IV was 13.50, 
12.83, 12.39, and 12.94, with oy’s of 
1.21, 1.53, .91, and .91, respectively. 
The within-groups variance is much 
larger than the between-groups vari- 
ance (F = .16).! It may be con- 
cluded that the four groups, consti- 
tuted of randomly assigned groups, 
did not differ significantly on practice- 
day performance. 

The original plan of the experiment 
called for matching the four groups on 
practice performance if random as- 
signment did not result in comparable 
groups. Such a procedure obviously 
assumes that performance on the 
practice and experimental lists will be 
highly correlated. Computations of 
product-moment correlations have 
shown that this is an invalid assump- 
tion. The r’s ranged from .39 for 
Group I to .08 for Group III. The 
value for all 72 Ss combined was .25. 
While we may expect the correlations 


With 3 and 68 df, an F of approximately 
2.74 is needed for the 5% level of confidence, and 
4.08 for the 1% level. On this and all subse- 
quent analyses Bartlett’s test for homogeneity of 
variance was made and was in no case found to 
be significant. 
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TABLE 1 


LEARNING AND RETENTION ScoRES FOR 
THE EXPERIMENTAL CONCEPTS 
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6 sec. | 25.44) 1.83) 29.89] 1.61] 7.00 | .38| 2.83 | .32 
15 sec. | 21.17) 1.50) 27.00) 2.33) 7.44 | .31| 2.78 | .49 
30 sec. | 20.67} 1.82) 23.56) 1.85} 6.67 | .26] 3.83 | .46 
60 sec. | 22.78) 2.23) 26.94) 2.76) 7.00 | .34| 2.33 | .24 
































to be lowered somewhat because of 
different conditions on the two days, 
nevertheless, a correlation of .39 for 
Group I (where conditions were the 
same) is unsatisfactory as a basis for 
matching. We must conclude that 
the performances on these two tasks 
do not have high communality. Be- 
cause of this, the evaluation of the 
results for the experimental day must 


rest largely on the assumption that 
the random assignment of Ss resulted 
in groups whose performance would 
not have differed significantly had 
they learned the experimental list 
under identical conditions. 

Learning and retention of experi- 
mental list—Data on learning and 
retention of the experimental list for 
the four conditions are shown in 
Table 1. 

The learning scores show that on 
both criteria the distributed-practice 
conditions produced slightly faster 
learning than the massed condition 
(6 sec.). However, tests of signifi- 
cance for these criteria give no basis 
for rejecting the null hypothesis. 
For the criterion of one perfect trial 
F is 1.28, and for two perfect, 1.41. 
Nevertheless, the learning curves (Fig. 
1) show small but consistent differ- 
ences between the massed condition 
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and all three distributed conditions 
throughout the major course of learn- 
ing. That these three curves of dis- 
tributed practice would all fall below 
the curve of massed learning is highly 
unlikely on a chance basis if no true 
difference existed. It is our conclu- 
sion that distributed practice has pro- 
duced a slight facilitation in the learn- 
ing of simple concepts. 

Cook (2) has suggested that in 
tasks requiring the discovery of rela- 
tionships massing should facilitate 
early in learning and distribution late 
in learning. The learning curves for 
the acquisition of the present list of 
concepts give no support to this hy- 
pothesis. 

The scores for recall and retention 
show no effect of variation in inter- 
trial interval during learning. None 
of the differences attains significance. 
For recall F is .95, for relearning, 2.65. 

Order of attainment.—An analysis 
was made of the order of attainment 
of the nine concepts. ‘To do this we 
determined the number of trials re- 
quired to learn each concept by each 
S to a criterion of two successive 
perfect responses. Since the results 
did not vary appreciably as a func- 
tion of conditions, the data for all 72 
Ss have been combined. To deter- 
mine if shade was a source of variance, 
we calculated the mean number of 
trials to attain the three white (out- 
line) concepts, the three gray con- 
cepts, and the three black concepts. 
Iu these determinations size is con- 
stant since each of the three sizes oc- 
curs equally often with each shade. 
The mean number of trials to attain 
the white concepts was 14.23, the 
gray, 14.31, and the black, 14.55. 
These values obviously do not differ 
significantly. 

Size, on the other hand, is related 
to rate of learning. Holding shade 
constant, the mean number of trials 





LEONARD OSEAS AND BENTON J. UNDERWOOD 


to learn the concepts based on all 
small figures was 13.74; the medium- 
size concepts required 16.64 trials, 
and the large, 12.71. These three 
distributions could not have come 
from the same population (F = 
20.51). As shown by t-tests, differ- 
ences between the means for the small 
and medium, and between the large 
and medium sizes are highly signifi- 
cant. The t-values are 5.29 and 3.90 
respectively. The difference in rate 
of learning the small and large con- 
cepts is not significant (¢ = 1.39). 

These results suggest the possibil- 
ity that differences in rate of attain- 
ment might be attributed to interfer- 
ence produced by stimulus generaliza- 
tion. In learning concepts of medium 
size, generalizing tendencies (of maxi- 
mal strength for the present experi- 
ment) from both the small and large 
figures could interfere. But, in learn- 
ing the concepts based on small figures, 
generalizing tendencies would be max- 
imal from the medium-size figures, 
but minimal as regards the large be- 
cause of their greater “distance” along 
the size dimension. Likewise, in 
learning the concepts based on large 
forms, interference again should be 
maximal from the medium-size figures 
but minimal from the small. 

Tests of the generalization hypothe- 
sis are available in the error data. If 
generalization is an adequate ex- 
planatory concept, the following facts 
must be present: (a) more errors must 
occur in learning the medium con- 
cepts than in learning either the large 
or small; (b) errors in learning the 
concepts based on small figures must 
show that more of the errors were 
“medium errors” than “large errors,” 
i.e., responses which would have been 
appropriate for a medium-size concept 
must occur with greater frequency 
than responses which would have been 
appropriate for a large figure; and (c) 
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errors in learning the concepts based 
on large figures should show that more 
of the errors were medium errors than 
small errors. 

In testing these expectations we 
counted all errors for all Ss for all 
conditions. All three expectations 
are confirmed. In learning the large 
concepts, 1084 errors were made; in 
learning the medium concepts, 1440, 
and in learning the small concepts, 
1186 errors occurred. In learning the 
large concepts, 406 errors were re- 
sponses appropriate to the medium- 
size figures, and 206 errors were ap- 
propriate to the small. In learning 
the small concepts, 467 responses ap- 
propriate to the medium concepts 
were made and only 179 which were 
appropriate to the large. These find- 
ings strongly support the generaliza- 
tion hypothesis. 


Discussion 


The two major findings of the pres- 
ent study are: (a) conditions of dis- 
tributed practice produce consistently 
faster learning than does the condition 
of massing; (b) concepts based on 
large and small figures are acquired 
more rapidly than those based on 
medium-size figures. Brief comments 
on the second finding will be given 
first. 

Analysis of the errors made in learn- 
ing strongly support the position that 
differences in order of attainment of 
concepts as a function of size are pro- 
duced by interferences resulting from 
stimulus generalization. This con- 
firms Buss’s results (1). Buss dem- 
onstrated a gradient of stimulus gen- 
eralization along a height dimension 
in concept learning. It seems clear 
that generalization is useful as an ex- 
planatory concept for all levels of 
learning, from conditioning through 
concept formation. 


The results which show that dis- 
tributed practice produced somewhat 
faster learning of concepts than did 
massed practice are completely com- 
patible with previous findings of rote 
learning. The learning of serial ad- 
jectives of low intra-list similarity 
(7, 8), and the learning of paired- 
associate nonsense syllables (4) show 
the same small but consistent differ- 
ences in favor of distribution as found 
in the present study. Distribution 
produces a more marked effect in the 
learning of serial adjective lists of 
high intra-list similarity (8), and no 
effect on paired adjective (6) and 
short verbal-discrimination lists (9). 

The fact that concept learning re- 
sponds to distributed practice in 
much the same way as rote learning 
does not necessarily mean, of course, 
that the same basic processes are re- 
quired in both forms of learning. On 
the other hand, descriptive differences 
between rote learning and concept 
formation (as pointed out in the in- 
troduction) do not irrevocably point 
to fundamentally different learning 
processes. If further tests show that 
concept learning and rote learning re- 
spond in much the same way to ma- 
nipulable variables, the position that 
we are dealing with the same basic 
learning processes becomes more ac- 
ceptable. Such an outcome would 
be highly desirable from the stand- 
point of parsimony in theory con- 
struction. The previous studies in 
this series have shown that present 
theories are not entirely adequate to 
account for the effects of distributed 
practice. The study reported here 
simply extends the range of empirical 
facts which will need to be encom- 
passed by a comprehensive theory. 

A final comment about the reten- 
tion results is needed. The present 
data showed that distributed practice 
during learning had no influence on 
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recall or relearning of the concepts 
after 24hr. However, it is our opinion 
that the present study was an inade- 
quate test of the possible relationship 
because retention under all conditions 
was quite high. This finding is in 
line with Reed’s results (5) and it 
recommends that retention intervals 
longer than 24 hr. must be used to 
determine the influence of distributed 
practice on the retention of concepts. 


SUMMARY 


To extend the empirical generaliza- 
tions concerning the influence of dis- 
tributed practice, simple concepts 
were learned with four different inter- 
trial rests; namely, 6, 15, 30, and 60 
sec. ‘The stimuli were three different 
geometrical forms of three sizes and 
three shades. Size and shade were 
relevant dimensions, form irrelevant. 
Nine concepts were learned, each 
being a different combination of size 
and shade./ Single letters served as 
concept indicators. Four groups of 
18 Ss each were used, a different group 
for each of the above four conditions. 
Retention was measured 24 hr. after 
learning. The results show: 

1. Distributed conditions (15, 30, 
or 60 sec. between each trial) pro- 
duced faster learning than did the 
massed condition (6 sec.). The dif- 
ferences were small but consistent 
throughout the course of learning. 

2. Retention did not vary as a 
function of intertrial interval during 
learning, being very high under all 
conditions. 

3. Concepts based on small and 
large geometrical forms were learned 
more rapidly than those based on 
medium-size figures. An analysis of 
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of the errors made during learning 
showed this finding to be expected on 
the basis of stimulus generalization. 
The results produced by distributed 
practice in concept learning were 
shown to be consistent with findings 
of the influence of this variable on 
rote learning. The descriptive dif- 
ferences between rote learning and 
concept learning may not reflect 
fundamentally different processes. 


(Manuscript received May 31, 1951) 


REFERENCES 


—_ 


. Buss, A. H. A study of concept formation 
as a function of reinforcement and stimu- 
lus generalization. J. exp. Pesychol., 
1950, 40, 494-503. 

2. Coox, T. W. Massed and distributed prac- 

tice in puzzle solving. Psychol. Rev., 
1934, 41, 330-355. 

3. Ertcxsen, S. C. Variability of attack in 
massed and distributed practice. J. exp. 
Psychol., 1942, 31, 339-345. 

4. Hovianp, C. I. Experimental studies in 
rote-learning theory: VIII. Distributed 
practice of paired associates with varying 
rates of presentation. J. exp. Psychol., 
1949, 39, 714-718. 

5. Reep, H. B. Factors influencing the learn- 
ing and retention of concepts: I. The 
influence of set. J. exp. Psychol., 1946, 
36, 71-87. 

6. Unperwoop, B. J. Studies of distributed 
practice: II. Learning and retention of 
paired-adjective lists with two levels of 
intra-list similarity. J. exp. Psychol., 
1951, 42, 153-161. 

7. Unperwoop, B. J. Studies of distributed 
practice: III. The influence of stage of 
practice in serial learning. J. exp. 
Psychol., 1951, 42, 291-295. 

8. UnpErwoon, B. J., & Goap, D. Studies of 
distributed practice: I. The influence of 
intra-list similarity in serial learning. 
J. exp. Psychol., 1951, 42, 125-134. 

9. UnpERwoop, B. J., & ViTerNna, R. O. 

Studies of distributed practice: IV. The 

effect of similarity and rate of presenta- 

tion in verbal-discrimination® learning. 

J. exp. Psychol., 1951, 42, 296-299. 

















SENSITIVITY TO APPARENT MOVEMENT IN DEPTH AS 


A FUNCTION OF STIMULUS DIMENSIONALITY ! 


WILLIAM M. SMITH 
Princeton University 


Continuous change in angular size 
of an object for which the usual dis- 
tance-cues are not available, such as a 
disc seen in the dark, is ordinarily per- 
ceived not as an alteration in the size 
of the object but as a movement of the 
object in the third dimension. This 
phenomenon of apparent movement in 
depth has received very little atten- 
tion in the experimental psychology of 
visual perception, as several recent 
writers have pointed out (3, 5, 7). 
As compared with the extensive litera- 
ture on the phi-phenomenon, for ex- 
ample, studies of the phenomenon of 
apparent movement in the third di- 
mension are few and far between. In 
addition, very little is known about 
this particular kind of movement 
regarding sensitivity functions. 

One study of illusory movement in 
depth (8) showed that sensitivity to 
the onset of movement did not vary 
significantly for objects which in 
everyday life are frequently seen in 
flight or in some kind of movement. 
A baseball, for example, was not re- 
sponded to any quicker with respect 
to onset of movement than were mean- 
ingless objects such as a disc. There 
were a few indications, however, that 
sensitivity in this respect might vary 
along a continuum of tridimensional- 
ity. Test objects with an obvious 
three-dimensional appearance might 
elicit faster reaction times than ob- 
jects which appeared flat. 

Such a possibility follows naturally 
enough from a modern extension and 


1 This study was supported by a grant to the 
Department of Psychology, Princeton University, 
from the Rockefeller Foundation. 
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application of the Helmholtzian no- 
tion of unbewusster Schluss according 
to which past experience, with its 
wealth of associations about objects 
of everyday use, plays a crucial role 
in perception. For example, it should 
follow from such a view that since the 
everyday environment presents many 
three-dimensional objects in move- 
ment, the sensitivity to the movement 
of such objects should be greater than 
for objects which are flat in appear- 
ance. The main purpose of the pres- 
ent experiment was to determine if 
this difference in sensitivity would be 
found. The secondary purpose of the 
study was to determine the effect of 
monocular and binocular vision on 
sensitivity to this kind of apparent 
movement. Since it has been as- 
sumed by most investigators of this 
phenomenon that this kind of illusory 
movement is maximal under monocu- 
lar vision and somewhat equivocal 
for binocular vision, it was pertinent 
to examine this assumption (5). 


APPARATUS AND STIMULUS 
MATERIALS 


Since the apparatus used in the present study 
was the same as that described in a similar ex- 
periment (8) the interested reader is referred 
there for a detailed description and diagram. 
The apparatus consisted of a long tablelike 
structure with a cubicle at one end from which S 
made observations. A superstructure on this 
table carried a small screen upon which was pro- 
jected the desired test object from a Clason 
projector mounted on top of the cubicle. The 
projector was gear-driven by a variable-speed 
transmission set at a constant speed. This ar- 
rangement gave the desired size-change of the 
projected image. By means of a front-surface 
mirror and an evaporated metal film, the image 
on the screen was reflected to S’s eye(s) with the 
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apparent location of the test object directly 
ahead. The use of an adjustable occluding 
device controlled monocular and binocular view- 
ing. Placed inside the cubicle to the right of S 
was a telegraph key which, when depressed, 
stopped the transmission, turned off the pro- 
jector, and stopped the electric timer which 
measured S’s reaction time. 

The stimulus material consisted of 35-mm. 
slides, one slide for each of three test objects, 
white on black. One figure was a Necker cube, 
another was a figure of the same size and outline 
but with internal lines radiating from the center 
in a spokelike fashion. ‘The width of the lines of 
this figure was the same as the width of the lines 
making up the cube. The third figure, a solid 
white irregular hexagon, was of the same size 
and outline as the other two figures. All test 
figures were of the same size and shape but 
differed in their internal makeup in order to 
produce differences in apparent tridimensional- 
ity. The Necker cube appeared three dimen- 
sional, the others appeared flat, especially the 
spokelike figure. Prior to carrying out the ex- 
periment, independent judgments of eight indi- 
viduals were taken. All judged the cube to look 
three dimensional, and the others more or less 
flat. The projected size of the test objects when 
initially seen by S, i.e., before a change in size 
was initiated, was 44 mm. (both dimensions). 
This size represented a visual angle of approxi- 
mately 40 min. The rate of size-change for the 
test figures was 1.8mm. persecond. The bright- 
ness of the solid test figure measured by a Mac- 
beth illuminometer was approximately one 
foot-lambert. 


PROCEDURE AND EXPERIMENTAL 
DESIGN 


The design of the experiment required obser- 
vations from six Ss who were all college students 
with 20/20 vision and naive regarding the ap- 
paratus and purpose of the experiment. 

A systematic latin-square design was used. 
The six combinations of the three test objects 
and the two viewing conditions were randomly 
ordered and rotated among the six Ss. Each S 
made four sets of observations, each set consist- 
ing of one observation for each of the six com- 
binations. The last three sets were replications 
of the first set, and only these three sets of data 
were used in the analysis of results. The first 
set was disregarded for reasons of practice. 

When S was brought in for observations, he 
was blindfolded outside of the experimental 
room, and then led to the cubicle where he was 
dark-adapted for 5 min. When properly seated 
and adjusted in the chin- and headrest, he was 
given the following instructions: 
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You are going to be shown different ob- 
jects in front of you. Your task is to watch 
what I show you and at the instant you per- 
ceive it moving toward you, press the key 
under your right hand. Remember, only 
press the key when you definitely see what I 
show you moving toward you, not when you 
think it is moving, but when you see it mov- 
ing. After you press the key, wait until I 
am ready for the next trial. Do not try to 
remove your head from the rest until I tell 
you todoso. Also, never try to move your 
head while making an observation. When I 
am ready, I shall show you what you are to 
judge and then at some later time start it 
moving. You should remember that what 
you see on any one trial will not necessarily 
move. In other words, on some trials 
everything will happen as usual, except 
there will be no movement. Depend only 
on what you actually see. Do you have any 
questions? 


For each trial E would adjust the occluder for 
the required viewing condition and then raise 
the mask placed in front of S’s eyes between ob- 
servations. The E would ask if S saw the test 
object, and at some variable time later E would 
engage the transmission. When S pressed the 
key, the projector light went out and the electric 
timer stopped. The E dropped the mask back 
in place, recorded the reaction time in .0l-sec. 
units, reset the projector and timer and prepared 
for the next trial. The false trials (i.e., no size- 
change) were interspersed at random throughout 
each set of observations. The S was given a 
brief rest between each set. When the obser- 
vations were completed, S was again blindfolded 
and taken outside the experimental room where 
he was asked a number of question pertaining to 
the observations. 


RESULTS 


The results of the experiment con- 
sist of the reaction times to movement 
and supplementary verbal material 
obtained from each S. Table 1 
gives the raw data for each S for each 
test object under each viewing condi- 
tion. The entry in each cell is a mean 
of three judgments expressed in .01- 
sec. units. As pointed out in a previ- 


ous study (8) the analysis of time data 
by statistical techniques requiring a 
normal distribution of data often de- 
mands a readjustment of the data by 
some kind of transformation. A plot 
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TABLE 1 


Mean Reaction Times or Stx Ss ror THREE 
Test Osyects uNDER Monocutar (M) anp 
Brnocutar (B) Viewrnc Conpitions 















































Test Figures 

Ss 
Cube Solid Spoke 
1 M 7.81* 5.45 7.83 
B 4.58 1.91 4.23 
Mean 6.20 3.68 6.03 
2 M 12.87 13.53 11.24 
B 8.15 10.67 10.88 
Mean| 10.51 12.10 11.06 
3 M 4.14 4.56 3.85 
B 3.65 2.94 4.04 
Mean 3.90 3.75 3.94 
4 M 8.36 8.85 9.79 
B 6.47 6.94 7.33 
Mean 7.42 7.90 8.56 
5 M 8.43 14.92 14.78 
B 6.54 7.05 5.34 
Mean 7.48 10.98 10.06 
6 M 10.51 5.96 5.54 
B 2.69 4.47 4.47 
Mean 6.60 5.22 5.00 

















* Each entry in the M and B cells is the mean of 
three measures in .01-sec. units. 


of the present data indicated the need 
of such a transformation and this was 
done by converting to three-place log 
scores. Each reaction time was trans- 
formed and the three scores for each 
S under each of the six combinations 
of conditions were summed to give a 
mean log score. 

Before carrying out the main analy- 
sis, it was necessary to determine 
whether the use of the latin-square 
design imposed any sequence-effects 
on the data. The presence of such 
effects would make it difficult to as- 
sess other effects unless they were 
known and allowance made for them. 


Consequently, an analysis of variance 
was made ona6 X 6 latin square, each 
cell in the square representing the mean 
log time of three observations. The 
results of this analysis gave a mean 
square for sequence of .021 (5 df) and 
a mean square remainder term of 
017 (20 df). The F-ratio for such 
results does not approach statistical 
significance. 

After it was decided that any effect 
due to sequence would not disturb 
subsequent analysis, the transformed 
data were arranged in a triple classi- 
fication table and an analysis of vari- 
ance carried out. The results of this 
analysis are given in Table 2. 

It can be seen from the table that 
none of the first order interactions is 
significant when tested against the 
triple interaction term. Main effects, 
however, when tested against this 
term give F-ratios significant at the 
.001 level for the effects of viewing 
conditions and Ss. The variance due 
to test objects, however, does not ap- 
proach significance. This result indi- 
cates no real differences in reaction 
times among the test objects differing 
in apparent dimensionality. 

The extremely large variance esti- 
mate for the effect of viewing condi- 
tion, which is larger than all other 
sources combined, should be noted. 


TABLE 2 


ANALYsIS OF VARIANCE FOR TRANSFORMED 
Reaction Time Data 








Sum of Variance 
Source Squares af Estimate 





1. Conditions (mono- 417 1 | .417* 
cular and binocular) 


2. Test figures 050 2} .025 
3. Ss 949 5 | .190* 
4. 1 X 2 interaction 021 2 | .010 
5. 1 X 3 interaction 117 5 | .023 
6. 2 X 3 interaction .146 10 | .015 
7. Triple interaction 179 10 | .018 














* Significant at the .001 level. 
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The difference between these two 
conditions consistently (one exception 
out of 18 comparisons) favored bin- 
ocular vision. 

The results given above can be sup- 
plemented somewhat by answers to 
questions asked of each S when he 
finished making observations. No 
attempt will be made to classify these 
answers systematically, but their gen- 
eral trend can be pointed out. Four 
of the six Ss, for example, stated that 
one particular test object seemed to 
move more readily than the others. 
Of these four Ss, three stated a pref- 
erence for the cube, the fourth S 
chose the solid. However, of those 
Ss preferring the cube, their reaction 
times did not agree with their prefer- 
ence. Just as the data showed no 
consistently faster reaction time to 
any particular test object, the post- 
experimental judgments likewise did 
not reveal a consistent preference for 
one particular object. However, all 
Ss agreed that the cube appeared 
three-dimensional whereas the other 
two test objects appeared more or less 
flat. 

With respect to the two viewing 
conditions, three of the six Ss stated 
that they were aware of changes of this 
factor from trial to trial, and that the 
movement was always easier to detect 
under the binocular condition. The 
remaining Ss who stated that they 
were not aware of this factor never- 
theless complained that on some trials 
their vision “didn’t seem quite right,” 
or some statement to that effect. 
These trials were presumably the 
monocular ones. Regardless of 
whether the factor of viewing was 
noted or not, no S was aware of the 
illusory nature of the movement ob- 
served, nor guessed the real purpose 
of the experiment. 
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Discussion 


In perceptual theory, the view that 
factors other than those definable in 
terms of present stimulation affect the 
final percept is not new. These “‘non- 
stimulus” factors are usually referred 
to some central mechanism which 
supplements, corrects, distorts, or 
“overlays” the data given in stimula- 
tion. It is at this point of the per- 
ceptual process that proponents of an 
empiricist view of perception usually 
maintain that past experience, learn- 
ing, etc. manifest their effects. Thus, 
for example, one explanation with re- 
gard to the phenomenon of color con- 
stancy might run as follows. We see 
an orange; such objects have a dis- 
tinctive hue which has been learned. 
When seen under monochromatic il- 
lumination in an otherwise normal 
setting, the learned response in the 
form of a “correction” or memory 
mechanism carries the perception 
despite the contradiction in stimula- 
tion, i.e., the orange is perceived as 
normal in color. The view is that in 
the past we have learned that a par- 
ticular object has particular properties. 
These properties, of course, originally 
emerge from the characteristics of 
stimulation. Once these properties 
are learned, subsequent perception of 
the object brings this learning into 
play. In the extreme case, the 
learned response may contradict the 
data given in present stimulation, and 
still determine the perception. In 
the context of the present experiment, 
the view would be that we have ob- 
served the movement of three-dimen- 
sional objects countless times in the 
past but not objects which appear 
flat. Given equivalent stimulating 
conditions for two such kinds of ob- 
jects, we might expect that the sensi- 
tivity to the onset of movement for 
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an object with an obvious three-di- 
mensional appearance would be 
greater than for an object with a flat 
appearance. 

The present results show, however, 
that the apparent tridimensionality of 
an object has no significant effect on 
the sensitivity to apparent movement 
in depth. These results supplement 
the findings reported in a similar 
study (8) where it was shown that 
movement sensitivity did not vary 
significantly with the “movement 
property” of an object over a variety 
of conditions of observation. It was 
true, however, that reaction times 
were consistently faster under one 
condition of observation for the 
stimulus designated as having the 
“property of movement,” i.e., the 
baseball. Had a variety of conditions 
been investigated in the present study, 
it is possible, too, that some one con- 
dition might have given faster reac- 
tion times to the object appearing 
three-dimensional. 

In a study by Lindemann (6) it was 
found that apparent dimensionality 
of a figure did affect the perception of 
movement, in this case, gamma move- 
ment, a type of apparent movement 
wherein an object appears to change 
in size with a change in illumination. 
Under certain conditions, this change 
is also accompanied by apparent 
movement toward S. This investiga- 
tor reports that when a figure was ap- 
prehended as a thing, it tended to 
remain static and when apprehended 
as a surface, it tended to expand or 
contract. In addition, he reports 
that gamma movement was less for a 
figure which appeared flat. 

These results for gamma movement 
would lead one to expect results dif- 
ferent from those found in the present 
study and opposite to the results pre- 
dicted by an empiricist hypothesis. 


They would suggest that the sensitiv- 
ity to apparent movement in depth 
would be greatest for the test objects 
having a flat appearance. Present 
results, however, show no real dif- 
ference among the test objects and 
hence are in disagreement with the 
gamma movement data and the re- 
sults expected from the empiricist 
point of view. Nevertheless, it is 
possible that differences among test 
objects would have been evident had 
the movement been observed over a 
period of time. Under such condi- 
tions the quality of the movement 
might have varied systematically with 
the apparent dimensionality of the 
test object, as in the case of gamma 
movement. One might hypothesize 
that such a factor as apparent di- 
mensionality does not affect sensitiv- 
ity functions but may affect the 
movement experience once movement 
is detected and observed for a period 
of time. 

With regard to the marked differ- 
ence found between monocular and 
binocular vision, it was not exactly 
expected that a difference, if found, 
would favor the latter. As pointed 
out by Ittelson (5), the movement in 
depth observed by his Ss was quite 
equivocal for the binocular condition, 
yet in the present study the reaction 
times for this condition were consider- 
ably and consistently faster than those 
for the monocular condition. On the 
basis of a perceptual theory of cues 
involving disparity of retinal images 
and allied binocular indications, it 
would be expected that binocular 
vision for this kind of movement 
would favor the perception of flatness 
and alteration in size with no move- 
ment, unless, of course, a considerable 
viewing distance were involved. 
Again the important factor may be 
the difference in detecting initial 
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change and the character of the 
change observed over a period of time. 
It is important to note that Ss in the 
present experiment were set by in- 
struction to observe movement and 
not size-change. Yet by the nature 
of the task, their discrimination was 
of initial change in angular size. 
Thus, had the present Ss observed the 
phenomenon for some time after de- 
tection, the quality of the movement 
might have been much depressed 
under binocular regard, even though 
detected more readily under this 
condition. For real movement in 
depth, sensitivity is greater for binocu- 
lar vision, as has been shown, for 
example, by Fallert (2). However, as 
pointed out in studies of visual acuity 
and other visual functions, the inferi- 
ority of monocular vision may result 
not from the use of one eye as such 
but from the introduction of rivalry, 
intensity, and other effects by the 
very act of closing or occluding the 
other eye (1,4). In any event, the 
results of greater binocular sensitivity 
to this kind of apparent movement do 
not necessarily make it less under- 
standable that the illusory effect is 
best observed under monocular re- 
gard. 


SUMMARY AND CONCLUSIONS 


An experiment was carried out to 
determine whether the sensitivity to 
apparent movement in depth varied 
with the apparent tridimensionality 
of objects viewed monocularly and 
binocularly. 

An analysis of variance of the re- 
sults, supplemented by statements 
from the Ss, indicated the following 
conclusions. 


1. No significant variance in reac- 
tion times was found to exist among 
the test objects used. Apparent tri- 
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dimensionality of an object had no 
significant effect on the time it took 
S to respond to apparent movement 
in depth. This result was discussed 
from the standpoint of the effect of 
object properties on perception and 
an empiricistic view of perception. 

2. Viewing condition (monocular 
and binocular) introduced significant 
variation in the data. The reaction 
times to movement were consistently 
faster for the binocular condition. 

3. An attempt was made to inter- 
pret the present results in light of 
other experimental results which indi- 
cate an effect on movement percep- 
tion of such a factor as apparent di- 
mensionality. In addition, an at- 
tempt was made to incorporate the 
results regarding the effect of viewing 
conditions with other pertinent bin- 
ocular-monocular data. 

4. Although no significant inter- 
actions existed in the data, marked 
individual differences in reaction times 
to this kind of apparent movement 
were found. 


(Manuscript received for priority 
publication October 15, 1951) 
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THE ROLE OF REPETITION RATE AND INTER-STIMULUS 
INTERVAL IN CONTEXT EFFECTS! 


J. M. DOUGHTY 
Franklin and Marshall College 


That psychophysical judgments are 
seriously influenced by the context in 
which they are made is a well-estab- 
lished fact. That the constant error 
(CE) can be moved about at will even 
with keen and experienced Os by ma- 
nipulating the context is known (1, 4). 
Current interpretations of these con- 
text effects are couched in terms of 
interaction effects among the stimuli 
making up a particular experimental 
series (2,3). That is, it is thought 
that stimulation that has previously 
occurred in some way interacts with 
subsequent stimulation so that O 
seems to be comparing the present 
comparison tone not with the actual 
standard but with some other stand- 
ard that approximates the mean of the 
series. If this interpretation is cor- 
rect then it is to be expected that con- 
text effects will vary as the temporal 
relations between the stimuli of the 
series are changed. For example, a 
previous high pitch will contrast less 
sharply with a subsequent low pitch 
if the time interval is lengthened con- 
siderably. The two ways in which the 
temporal separation of stimuli in the 
psychophysical methods may be ac- 
complished is by changing the rate at 
which the stimuli are presented and 
altering the separation between the 
standard and comparison stimuli. 
One may hold repetition rate constant 
and increase the separation between 
the paired standard and comparison, 


1 This research was carried out under terms of 
a contract between Special Devices Center, Office 
of Naval Research, and The Johns Hopkins 
University, Contract N5-ori-166, Task Order I. 
This is Report No. 166-I-135, Project Designa- 
tion No. NR-784-001 under that contract. 


or one may hold the separation be- 
tween standard and comparison con- 
stant and change the repetition rate. 

The purpose of this experiment was 
to explore the role of the temporal 
separation of stimuli on context ef- 
fects. To this end repetition rate 
and inter-stimulus interval (the sepa- 
ration between standard and compari- 
son stimuli) were varied in the meth- 
ods of constant stimuli and average 
error. CE’s and CL’s for pitch were 
determined in a balanced and unbal- 
anced context to see if they varied in 
any systematic way with repetition 
rate and inter-stimulus interval. 

As a point of comparison CE’s for 
pitch were also determined by the 
method of single stimuli. It was ex- 
pected that the maximum effect of 
the unbalanced context would be dis- 
played in this method since the stand- 
ard stimulus is sounded only once at 
the beginning of a run. Thus, the 
context within which O makes his 
judgments of pitch is composed almost 
exclusively of the comparison stimuli. 
The modifying influence of the stand- 
ard tone (repeated for each judgment 
in the method of constant stimuli) is 
absent. 


DESCRIPTION OF THE EXPERIMENT 
A pparatus 


The apparatus used in this experiment was 
identical to that described in detail in a previous 
paper (1). 


Procedure 


Method of constant stimuli—The O was 
seated in the sound-deadened room and asked to 
adjust the headphone so that it fitted comfort- 
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ably. He was instructed to use the same ear at 
all times. He was told to listen to the standard 
stimulus and to judge whether the comparison 
tone was higher or lower in pitch. He was told 
to record this judgment in the appropriate place 
on the tally sheet. 

In a single run O made 70 such judgments; 
ten judgments per point at seven points. In 
all, there were five runs for each O for each set 
of conditions, making a total of 50 judgments 
per point per O. 

Method of average error.—The procedure was 
the same as that for the method of constant 
stimuli except, of course, that O adjusted the 
variable stimulus until the two tones were 
matched in pitch. One judgment was made at a 
time and a new set of conditions was set in be- 
fore O was asked to make another judgment. 
Between judgments he removed the earphones. 
Sets of about 12 such judgments were regularly 
alternated with the runs under the method of 
constant stimuli. Each O made a total of 30 
judgments per point. 

The E used the fine vernier adjustment on the 
beat-frequency oscillator in combination with 
the oscillograph in determining the O’s setting 
for his pitch matches. These were recorded in 
terms of the frequency to match the standard. 


Experimental Variables 


Degree of asymmetry: Method of constant 
stimuli.—This refers to the degree of skewness in 
the distribution of the comparison stimuli about 
the standard. There were two degrees of asym- 
metry defined in terms of the number of com- 
parison stimuli higher than, equal to, and lower 
than the standard stimulus. Zero asymmetry 
had three higher, one equal, and three lower; 
+2 asymmetry had five higher, one equal, and 
one lower. 

Degree of asymmetry: Method of average error. 
—For this method, there were two degrees of 
asymmetry in the stimulus context, defined as 
follows: (0)}—The O was started at a high or low 
point and was instructed to bracket the standard 
as symmetrically as possible in making the 
match; (+ )—the O was started at a point higher 
than the standard and was instructed to change 
the variable in one direction only until the match 
was made. 

Comparison stimulus durations —Comparison 
stimulus durations of 50 msec. and 25 msec. 
were used to provide two levels of difficulty in 
making judgments. 

Repetition rates—Repetition rates of 3, 5, 
10, and 20 per minute were used. 

Inter-stimulus interoals——Temporal separa- 
tions between standard and comparison stimuli 
of 0, .5, 2.0, 4.0, and 8.0 secs. were used. 
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Foreknowledge.—In this condition, O was told 
what was the distribution of comparison stimuli 
with respect to the standard stimulus; e.g., O 
was told that there were five frequencies higher 
than the standard for every one equal to or 
lower than the standard. 

Experimental constants —For all judgments 
the following were constant: (a) The duration 
of the standard tone at 1000 msec. (b) The 
frequency of the standard tone at 1000 cps. 
(c) The intensity of the standard tone at 80 db 
re .0002 dyne/cm.? for two Os and 60 db re .0002 
dyne/cm.? for one O. (d) The mode of listening 
as monaural, 

Training procedure——Two Os were used 
throughout the experiment. They were both 
males about 27 years of age. A third O, female 
and in her early twenties, was used in about half 
the experiment. 

Each O was given a period of training in mak- 
ing pitch judgments. At the same time they 
were gaining familiarity with the procedures for 
the experimental methods we used. This in- 
cluded experience with both comparison tone 
durations and with varying degrees of asym- 
metry. This period ended, and the actual ex- 
periment began for each O when he had reached 
the point of maximum discrimination, as defined 
by no further improvement. 

During the training the Os were repeatedly 
instructed not to be concerned with the distribu- 
tion of their judgments (the number of “highs” 
or “lows,” for instance) but to make all their 
judgments only with respect to the relation be- 
tween each standard-comparison pair as it came 
along. 

In the training period for the method of 
average error, and throughout the experiment, 
the Os were told that their primary task was to 
make the equation in pitch. This task was to 
take precedence over moving the variable stim- 
ulus in one direction only when they were judg- 
ing under the conditions of asymmetry outlined 
above for this method. 

Analysis of data.—For the method of average 
error data, CE’s were computed from the mean 
of the distribution of judgments for each of the 
Os. The DL was taken as the PE of the dis- 
tribution of each O’s judgments. An average of 
the individual CE’s was computed for the pres- 
entation of the results in graphical form. 

The least squares solution for the phi-gamma 
function, with unweighted values, was used to 
determine the CE’s and DL’s from the responses 
of the Os in the method of constant stimuli. 
The CE was taken as the 50% point and the 
DL as one-half the interval between the 25% 
and 75% points. Each O’s data were analyzed 
in this way and an average computed for pres- 
entation in graphs. 
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RESULTS 
Effect of Repetition Rate 


Figures 1 and 2 show the magnitude 
of the influence of an asymmetrical 
context on CE’s for pitch as a func- 
tion of the rate of presentation of the 
stimulus pairs. Figure 1 shows this 
relation for the methods of constant 
stimuli (solid line) and single stimuli 
(broken line) and for the method of 
constant stimuli with foreknowledge 
of the asymmetry in the context 
(dotted line). Figure 2 shows this 
relation for the method of average 
error. 

The points on Fig. 1 are based on 
the judgments of two Os and those in 
Fig. 2 on three Os. They were ob- 
tained by subtracting the CE in 
cycles per second at zero asymmetry 
from the CE at +2 asymmetry. 

In Fig. 1 some indication of inter- 
observer variability is shown by the 
vertical lines passing through the 
points on the graph. The terminal 
points of these vertical lines represent 
the results obtained from the indi- 
vidual Os. For example, one O’s 
results are indicated by the small 
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Fic. 1. The magnitude of the influence of an 
asymmetrical context (+2) on CE’s for pitch as 
a function of repetition rate. The CE’s are for 
a comparison tone of 50 msec. and an interval of 
500 msec. between standard and comparison. 


Each curve is for a different method as indicated..- 


The points are based on the judgments of two Os, 
whose individual results are shown by the 
terminal points of the vertical lines. 
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Fic. 2. The magnitude of the influence of 
an asymmetrical context (+2) on CE’s for pitch 
determined by the method of average error, as a 
function of repetition rate. The CE’s are for a 
comparison tone of 50 msec. and an interval of 
zero second between standard and comparison. 
The points are based on the judgments of three 
Os. The size of the SD of the Os’ judgments is 
indicated by the vertical lines. 


horizontal line drawn through one of 
the terminal points of the vertical line. 
In Fig. 2 the size of the SD of the Os’ 
judgments is indicated by the vertical 
lines. 

From these figures several conclu- 
sions may be drawn: 


1. The CE for pitch as determined 
by the method of constant stimuli is 
profoundly affected by asymmetry in 
the context at all rates of presentation. 
This is true when the stimuli in the 
context are presented very slowly (3 
per min.) as well as when they are 
presented very rapidly (20 per min.). 

2. The effect of asymmetry on the 
CE for pitch appears to be as great 
in the method of constant stimuli as 
in the method of single stimuli. The 
points on the dotted line are not reli- 
ably different from those on the solid 
line in Fig. 1. 

3. The effect of the asymmetry in 
the context on the CE for pitch is con- 
stant throughout the range of repeti- 
tion rates used in the method of con- 
stant stimuli and method of single 
stimuli. None of the points on the 
individual curves in Fig. 1 is reliably 
different from any other. 
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4. In the method of average error, 
the effect of the asymmetry in the 
context is less at the fast repetition 
rate (20 per min.) than at the slow 
repetition rate (3 per min). 

5. There is a significant effect of 
the asymmetry in the context on the 
CE for pitch even when there is fore- 
knowledge of the unbalanced nature 
of the context. The magnitude of 
the effect, however, is significantly 
less with foreknowledge than without. 


Effect of Inter-stimulus Interval 


Figure 3 shows how the effect of an 
asymmetrical context on the CE for 
pitch varies with the interval between 
standard and comparison stimulus 
(inter-stimulus interval). The solid 
line shows the results obtained with 
the method of constant stimuli at two 
repetition rates (3 and 5 per min.) 
and the broken line shows the results 
obtained with the method of average 
error at a repetition rate of 3 per min. 
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Fic. 3. The magnitude of the influence of 
an asymmetrical context on CE’s for pitch as a 
function of inter-stimulus interval. This rela- 
tion is shown at two repetition rates for the 
method of constant stimuli (3 and 5 per min. as 
indicated in the solid curves) and at one repeti- 
tion rate for the method of average error (3 per 
min.). The comparison tone duration was 50 
msec. in all cases. The points on the solid line 
are based on the judgments of two Os; those on 
the broken line on the judgments of three Os. 
The open square represents the data from the 
method of single stimuli obtained at a repetition 
rate of 3 per min. 





The solid curve (method of con- 
stant stimuli) is based on the judg- 
ments of two Os. The broken curve 
(method of average error) is based on 
the judgments of three Os. 

The conclusions that can be drawn 
from the data as displayed in Fig. 3 
are similar for both methods: 


1. The effect of the asymmetrical 
context is significant at all inter- 
stimulus intervals. 

2. There is no systematic change in 
the influence of the asymmetrical 
context with changes in the inter- 
stimulus interval. The influence of 
context appears to be constant 
throughout the range of intervals 
used in this experiment. 


Method of single stimuli—The 
greatest separation between standard 
and comparison stimuli in this ex- 
periment.is actually to be found in 
the method of single stimuli. For 
this reason, data obtained by this 
method have been entered in Fig. 3 
in the form of the single point indi- 
cated by the open square. This 
point was extracted from Fig. 1 show- 
ing the shift in the CE resulting from 
the asymmetrical context as it occurs 
at a repetition rate of 3 per min. 

The point represented by the open 
square lies above the other curves in 
the graph but is not reliably different 
from them. This result tends to con- 
firm the conclusion that the effect of 
context is a constant one for all 
degrees of separation between stand- 
ard and comparison stimuli. 


Discussion 


As was pointed out in the introduc- 
tory section of this paper, changing 
the temporal relations between the 
stimuli in the series should alter the 
supposed interaction between them 
and, consequently, the effect of con- 
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text. The fact that the effect of 
context on the CE for pitch is con- 
stant throughout the range of repeti- 
tion rates and inter-stimulus intervals 
used in this experiment argues against 
the view that context effects are ex- 
plicable solely in terms of interaction 
effects among the stimuli. 

On the other hand, such results as 
these are consistent with the view that 
the O has a tendency to split his judg- 
ments 50—50 between the two alterna- 
tive responses and that this tendency 
can account for context effects. With 
such a tendency remaining constant, 
it is to be expected that the effect of 
asymmetry in the context should be a 
constant one. 

Do context effects disappear when 
such a tendency is removed? Data 
relevant to this question are provided 
by the series of judgments made at 
the end of the experiment with fore- 
knowledge of the asymmetry. The 
results in Fig. 1, dotted line, show that 
even with this foreknowledge there 
still remains some effect of context. 
Is this residual effect due to interaction 
effects? An affirmative answer can- 
not be given to this question because 
it is conceivable that the tendency to 
split the judgments 50—50 has not been 
eliminated by foreknowledge. The 
O knows that there are some stimuli 
lower in frequency than the standard. 
He has been shown that. When he is 
in doubt on difficult discriminations 
and when he has already made a great 
number of “higher” judgments, he 
may still have a tendency to call the 
difficult ones “lower” more often than 
he would in a balanced context. 

Two other facts should be borne in 
mind in considering the interpretation 
of context effects: (a) There appears 
to be a reliable trend toward increas- 
ing DL’s with decreasing repetition 
rate; (b) The DL was significantly 
larger for the inter-stimulus interval of 


zero second than for any of the other 
intervals used. (The trend with in- 
creasing inter-stimulus intervals be- 
yond zero second was unreliable.) 
Both of these facts are hard to recon- 
cile with the view that context effects 
are explicable solely in terms of the 
tendency on the part of the O to split 
his judgments 50-50. As was pointed 
out in a previous paper (1) such tend- 
ency should be greater when the dis- 
criminations are more difficult. The 
larger DL’s indicate a greater diffi- 
culty in making pitch discriminations. 
There should be, then, a direct rela- 
tion between the magnitude of the 
effect of the context on the CE and 
the size of the DL. This, of course, 
is not the case. 

On the other hand, the fact that 
there appears to be a reliable trend 
toward increasing DL’s with decreas- 
ing repetition rate is more in line with 
the view that the influence of context 
is due to interaction effects among the 
stimuli in the series. The O responds 
as though a new standard, approxi- 
mating the mean of the series, is de- 
veloped in the course of the run as a 
result of these interactions. This 
new point of reference has to act over 
a longer period of time for the slower 
repetition rates, and decreasing acuity 
of judgment is to be expected under 
such circumstances. How the rela- 
tion between size of DL and inter- 
stimulus interval fits into such a 
scheme is difficult to say because of 
the unreliability of the results on this 
point. 

In the light of this discussion, no 
conclusions can be drawn concerning 
the interpretation of these context 
effects other than that neither one of 
the two views expressed here is ade- 
quate to account for all the results 
obtained. 

Method of average error—The data 
displayed in Fig. 2 show that context 
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is less effective when the repetition 
rate is fast than when it is slow. It 
has been pointed out (1) that the ap- 
parent context effects obtained in the 
method of average error may be ex- 
plicable in terms of an adjustment 
tendency. The O tends to move the 
variable stimulus up to the edge of the 
interval of uncertainty and to stop 
there when he is approaching the 
standard from one direction (the con- 
dition of asymmetry in this study). 
This factor plus the tedium of making 
pitch matchings when the repetition 
rate is very slow (3 per min.) may 
result in a greater degree of under- 
shooting at the slow rate than at the 
fast rate (where there is, presumably, 
less tedium). This would account 
for the results obtained. 


SUMMARY AND CONCLUSIONS 


This experiment was designed to 
explore the variables of repetition rate 
and inter-stimulus interval in relation 
to the effect of context on judgments 
of pitch. The difference in the CE 
obtained in a balanced context and 
the CE obtained in an unbalanced 
context was determined under several 
rates of presentation (from 3 to 20 
per min.) and several time intervals 
between standard and comparison 
stimuli (ranging from zero to 8.0 sec.). 
Under these conditions DL’s were 


also computed. These variables were 
explored in the method of average 
error, method of single stimuli, and 
method of constant stimuli. The 
following conclusions were reached: 


1. The effect of context is constant 
over the range of repetition rates and 
inter-stimulus intervals used in this 
experiment in both the methods of 
constant stimuli and single stimuli. 

2. The apparent decrease in the 
effect of context at faster repetition 
rates in the method of average error 
may be interpretable in terms of an 
adjustment tendency on the part of O 
rather than strictly in terms of context 
effects. 


(Manuscript received for priority 
publication October 15, 1951) 
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